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PREFACE 


This book is written for the student who is just beginning a serious 
study of mathematics but who wishes to become acquainted as quickly 
as possible with the interesting and powerful ideas that have flowered in 
this century. The main prerequisites are a strong desire to learn and a 
willingness to think energetically. No more mathematical background 
than standard high-school courses in plane geometry and elementary 
algebra is assumed. Additional experience, mathematical or not, is helpful 
primarily because of its contribution to maturity and motivation. The 
purpose of the book is, briefly, to help the student obtain a modest mathe- 
matical literacy. By energetic study he may expect to acquire from it a 
minimal orientation in modern mathematics, a basic vocabulary of mathe- 
matical terms, and some facility with the use of mathematical concepts 
and symbols. He should then be ready to study mathematical statistics 
and to take further work in mathematics, and he should be equipped with 
the mathematical tools most essential in the physical sciences, engineering, 
the biological sciences, and the social sciences. 

The beginner in mathematics may be compared with someone about 
to take up a foreign language. He may wish merely to learn to speak, read, 
and understand the simplest phrases, in short, just enough “to order 
breakfast.” There are several books directed toward the student who 
wants only this so-called “practical” acquaintance with elementary math- 
ematics. On the other hand, the student of a language may wish to be 
able to carry on an intelligent conversation, to read significant material, 
to write, and even perhaps to think in the language. He must be interested 
in grammar, syntax, and, above all, in acquiring an understanding and a 
“feel” for the language. Such a student of mathematics acquires manip- 
ulative skill as a by-product of striving for insight and thorough knowl- 
edge. It is for this kind of student that the present book has been written. - 

The traditional college curriculum in mathematics is largely an ossifica- 
tion of the program designed to meet the needs of engineers before World 
War I. Since that time mathematics has developed as never before in its 
history. Altogether new branches have grown up, and new light has been 
shed on old ones. Mathematical methods have been applied to new fields, 
and unexpected developments in science and technology have required 
the use of novel mathematical theories. The old curriculum no longer 
meets our needs in the training of scientists, engineers, and mathematicians. 

The inadequacy of the traditional mathematics curriculum is most 
obvious in relation to the needs of the social scientist. Today there is 
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hardly a branch of mathematics that does not find application in eco- 
nomics and psychology. Since 1930, when the Social Science Research 
Council appointed a committee to report on college mathematics, there 
has been considerable study, experimentation, and discussion. There 
seems to be fairly general agreement that social scientists need a mathe- 
matical curriculum that brings them quickly into contact with modern 
mathematics with the omission of traditional topics that no longer con- 
tribute either to applications or understanding. - 

It might seem that the increasing needs of the social scientist would 
make it desirable to plan different introductory courses for social and 
natural scientists. However, it turns out that the topics considered de- 
sirable by the social scientist are exactly those that are fundamental for 
all users of mathematics, for the educated citizen, and for the prospective 
mathematician.* The proposals for curricular change made by social 
scientists and psychologists are very similar to those proposed by com- 
mittees of mathematicians who have recently considered the high-school 
and elementary college programs for all students. This is not really sur- 
prising, because the basic characteristic of mathematics 15 its universality 
of application. Therefore it is natural that ideas of fundamental im- 
portance in one field of application and in mathematics itself should turn 
out to be also of fundamental importance elsewhere. 

This book has developed out of a number of years of experimentation 
with elementary mathematics courses at Carleton College. Experience 
proved that students of all degrees of ability and interest were able to 
understand, enjoy, and apply supposedly very “abstract” and “difficult” 
ideas of modern mathematics. A preliminary edition has been used at 
Carleton and by a number of professors in different parts of the country. 
The present book is a completely rewritten version incorporating major 
and minor revisions based on classroom experience and a number of de- 
tailed critiques. 

To achieve the aim of moving rapidly to significant modern material 
without getting bogged down in traditional detail, many innovations have 
been necessary. Among these are: (a) The symbolism of logic and set 
theory is presented early and used throughout the book. (b) A simple 
notation indicating substitution for variables in formulas is adopted. 
(c) Exercises are inserted in the body of the sections in order to require 
reader participation. Answers are given at the end of each section. (d) Ap- 
plications are to a very wide field in the humanities, arts, biology, and 
social sciences, as well as to the physical sciences and engineering. (e) 
There is an intentional and carefully explained variation in rigor, the 

* See, for example, “Mathematics for Social Scientists,” by R. R. Bush, 


W. G. Madow, Howard Raiffa, and R. M. Thrall, American Mathematical 
Monthly, October, 1954. 
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greatest rigor being applied to simpler topics. (f) There are numerous 
departures from traditional order where this makes for greater brevity . 
and clarity. I am convinced that there remain many unexplored possi- 
bilities for further simplifying elementary mathematics. 

The mathematician who casually picks up this book and glances at the 
first pages may get the impression that it is a textbook on foundations of 
mathematics. This is not the case. The material on foundations, logic, 
and sets has mathematical importance in itself, but it is presented at the 
beginning of the book as a pedagogical device to speed up and make more 
᾿ efficient the student’s acquisition of mathematical understanding and skill. 
In the first three chapters, of the twenty-nine sections not used for in- 
troducing a chapter and not marked for possible complete omission, 
seventeen are devoted entirely to review of algebra. In the others, much 
of the content consists of traditional elementary mathematics, and many 
of the details relating to logic and set theory may be omitted by the 
teacher who wishes to move forward quickly to other topics in algebra 
and analysis. This is not a textbook on logic. It is a textbook that uses 
logic to attack elementary mathematics. In doing so it follows the tradi- 
tion founded by Euclid, whose treatise on geometry used logic as a 
pedagogical and organizational device. 

In the preliminary edition, applications were concentrated in the social 
sciences. The present book still contains social science applications, but 
there is now a balance between applications in the physical and social 
sciences. A careful check has been made to be sure that the student will 
be supplied with every technique needed in elementary courses in physics, 
chemistry, and engineering. For example, every mathematical technique 
required for the study of University Physics by Sears and Zemansky has 
been included, often by requiring the student to work out the very manipu- 
ations and results he will meet when reading that text. 

Perhaps the title, Elements of Modern Mathematics, calls for some ex- 
planation. In the 20th century mathematics has been characterized by 
an explosive creativity, rapid expansion of its fields of application, an in- 
creasing use of abstract theories, and a tendency toward unification in 
terms of the concepts of set theory. “Modern” has come into use as a 
catchword to describe mathematical teaching that reflects these features, 
and it is used in this sense in the title. It does not signify that everything 
of old vintage has been rejected, but rather the belief that everything, of 
whatever age, should be reconsidered in the light of our current knowledge 
of mathematics and its applications. I have no doubt that this book 
represents only one step along the path of fully modernizing the mathe- ἡ 
matics curriculum. 

The help of many colleagues and students was acknowledged in the 
preliminary edition. However, I wish to acknowledge again the influence 
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and encouragement of the pioneering texts of Moses Richardson, the 
- College Mathematics Staff of the University of Chicago, Carl Menger, 
E. Begle, and F. L. Griffin. The following have made helpful suggestions 
for revision of the preliminary edition: W. J. Baumol, E. J. Cogan, C. H. 
Dowker, 5. Dreyer, John Dyer-Bennet, W. L. Garrison, H. Gulliksen, 
¥. Harary, H. Howden, 5. P. Hughart, N. L. Jacobson, M. Kline, R. P. 
May, H. 8. Moredock, J. Nystuen, M. W. Oliphant, E. Pimsler, C. Pog- 
genburg, J. Roth, D. Schier, J. Shear, R. Sloan, R. Stewart, D. M. Stowe, 
G. R. Trimble, A. W. Tucker, 5. Weintraub, and F. L. Wolf. I am also 
indebted to several anonymous reviewers and to the staff of Addison- 
Wesley Publishing Company. I hope that readers of this book will not 
hesitate to make corrections, criticisms, and suggestions. 

K. O. M. 
Northfield, Minnesota 
January, 1959 


NOTE TO THE STUDENT 


On the first page of his How to Play the Chess Openings, Znosko-Borovsky 
writes, “It is not, then, by memorizing variations that we shall become 
proficient in playing openings, but by understanding their meaning, their 
purpose, and the general ideas and principles which are their foundations.” 
And on the very last page, he repeats: “Do not trust to your memory or 
learn variations by heart. Learn to pick out the directive ideas of an 
opening for yourself, and above all remember that you alone are the 
creator of your game, and your task starts at the very first move.” This 
excellent advice is as appropriate to mathematics as it is to chess. If you 
wish to study mathematics efficiently, you must center your work on 
understanding fundamentals and on doing your own thinking. Of course, 
you have to memorize some definitions and rules of procedure (just as you 
must learn the moves in chess), but forgotten facts can easily be looked 
up and those that are used frequently will naturally come to be remem- 
bered. Understanding, however, can neither be looked up nor forgotten. 

According to the dictionary, to understand something is to be thor- 
oughly familiar with it, to visualize it clearly, to be able to explain, jus- 
tify, and use it. How, then, can you come to understand the mathematics 
in this book? Only by working with it! Mathematics cannot be mastered 
passively, by merely reading. Reading is only the first step, to be followed 
immediately by your own activity. Rephrase what you read in your own 
words. Illustrate it by examples. Draw a picture or diagram suggested 
by it. Try to find an example that contradicts it. Relate it to previous 
discussion. Try to justify it. Try to prove it wrong. Experiment with 
variations. Try to generalize and specialize its meaning. Analyze its 
form and parts. Explain it to yourself. As Lewis Carroll wrote in the in- 
troduction to his Symbolic Logic, “One can explain things so clearly to 
one’s self! And then, you know, one is so patient with one’s self: one 
never gets irritated at one’s own stupidity!” Of course, some of what you 
read will not deserve such detailed treatment, but many parts of this book, 
especially the formal theory, compress a tremendous amount of informa- 
tion into a very small space. These can be understood only by extensive 
activity on your part. Your most important tools in this work are a pencil, 
plenty of paper, and a large wastebasket. Exercises are inserted in the 
body of the text to help you take an active part. These exercises must be 
done as you read, since they are essential to the descussion. 

Unfortunately, mathematics is often taught as a mere matter of learn- 
ing instructions and following them. You may have the habit of not try- 
ing anything in mathematics until you already “know what to do,” until 
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you feel that you “understand” the problem. If this is your habit, try to 
overcome it as quickly as possible, for it will be the main obstacle to your 
progress. In order to learn to understand, in order to develop the ability 
to find out what to do, you must go boldly ahead. If you are not sure what 
to do with a problem, experiment and see what happens! If the results are 
not satisfactory, try something else. Experiment, record your work, 
think about your results, and experiment again. In this way, and only 
in this way, will you develop the confidence to think independently in 
mathematics. 

If in your reading or your efforts to solve problems you are unable to 
get satisfactory results in spite of numerous experiments and hard work, 
review previous parts of the book, or do something else with the intention 
of returning to the troublesome question later. Often a change of line of 
thought, or even a complete rest, will accomplish more than hard work. 
You cannot understand everything at once, and there is no need to give 
up because some items are obscure. In the words of one of the best-known 
living mathematicians, Norbert Wiener, “It is not essential for the value 
of an education that every idea be understood at the time of its accession. 
Any person with a genuine intellectual interest and a wealth of intellectual 
content acquires much that he only gradually comes to understand fully 
in the light of its correlation with other related ideas. ... Scholarship is a 
progressive process, and it is the art of so connecting and recombining 
individual items of learning by the force of one’s whole character and 
experience that nothing is left in isolation, and each idea becomes a com- 
mentary on many others.” 

kK. O. M. 
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NOTE TO THE TEACHER 


Although this book is tightly structured, it may be adapted to a wide 
variety of courses, provided only that the teacher wishes to make use of 
the simplest ideas of logic and sets. Many sections (marked with a star) 
may be omitted altogether without disturbing the continuity. Others 
may be skipped if minor adjustments are made later. Considerable varia- 
tion is possible by choice of topics and problems within the sections. 
Problems are graded from the easiest exercises in the body of the sections 
to really difficult problems toward the end of the list following each sec- 
tion. Answers to exercises and to selected problems are given at the end 
of each section. Many problems are paired, one that is answered being 
followed by a similar one without an answer. There are more problems 
than can be done in the time usually available, so that students and 
teachers can choose those appropriate to their interests. 

For those who wish to reach as quickly as possible the material on 
analysis beginning in Chapter 4, the number of days spent on logic and 
sets may be reduced to a total of six. Probably, even for the students with 
strongest preparation in algebra, another seven class periods should be 
spent on algebra review in the first three chapters. For students with weak 
backgrounds in algebra, the time spent on review should perhaps be 
doubled. This suggests that Chapter 4 could be reached in a minimum of 
eight assignments spent on logic, sets, and other basic ideas, that a feasi- 
ble schedule for well-prepared students calls for thirteen days, and that a 
total of twenty-one days or more would be required for those with weak 
preparation in algebra. The following are schedules for thirteen and 
twenty-one days. 


1. 1-1 through 1-4 ὃ. 2-1, 2-2, 2-3 9. 3-1, 3-2 
2. 1-5 through 1-10 6. 2-5, 2-6, 2-7 10. 3-3,3-4 Ὁ 
3. 1-11 through 1-15 7. 2-8, 2-9 11. 3-5 
4, 1-16, 1-17 8. 2-10 12. 3-6 

13. 3-9 
1. 1-1, 1-2, 1-3 8. 1-14 15. 2-9 
2. 1-4 9. 1-15 16. 2-10 
3. 1-5, 1-6, 1-7 10. 1-16, 1-17 17. 3-1, 3-2 
4. 1-8, 1-9 11. 2-1, 2-2 18. 3-3, 3-4 
5. 1-10, 1-11 12. 2-3 19. 3-5 
6. 1-12 13. 2-5, 2-6, 2-7 20. 3-6 
fe 14. 2-8 21. 3-9 
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For engineering courses in which it may be desirable to reach some 
calculus (Chapter 7) early in the first semester, either of the two plans 
above could be followed by a schedule such as this: 


4 


14a, 49 9. 7-3 
2. 4-8 10. 7-4 
3, 4-5 11. 7-5 
4. 5-1, 5-2, 5-3 12. 7-6 
5. 5-4 13. 5-9 
6. 5-7 147-7 
7. 5-8 15. 5-10 
8 


. 7-1, 7.2 16. 7-8 


Before continuing with Chapter 7, the class should cover the omitted 
sections in Chapters 4, 5, and 6 (except the starred optional sections). 
Variations are possible. Such accelerated schedules require concentration 
on essentials and occasional class references to omitted material. If slighted 
topics can be taken up again later in the year, advantages can be gained 
from double exposure and increased student maturity. 

For a year course stressing foundations and meeting three hours a week, 
the following is a possible schedule. (Parts of Chapter 7 could be included 
by cutting shorter the work in Chapters 6, 8, 9, or 10.) 


Chapter 1: 15 assignments Chapter 6: 12 assignments 
Chapter 2: 11 ἡ Chapter 8: 6 ‘ 
Chapter 3: 9 ᾿ Chapter 9: 8 . 
Chapter 4: 10 ᾿ Chapter 10: 7 7 
Chapter 5: 11 ἡ 


A year course concentrating on analysis can be based on very thorough 
caverage of the first seven chapters. If the students are strong in algebra, 
time not needed for review can be used for selections from Chapters 6, 
8, 9, and 10. Chapter 7 can occupy from fifteen to thirty class hours, 
depending on the emphasis and thoroughness of coverage. 

For a one-semester course to be followed by a course in analytics and 
calculus, Chapters 1, 2, 3, 4, and 5, and selections from Chapters 6, 8, 9, 
and 10 would be appropriate. 

Any of these courses could, by choice of problems and topics within 
sections, be adapted to terminal or introductory courses, to special courses 
for teachers, social scientists, or engineers, and to varying levels of student 
ability. 

K. O. M. 


CHAPTER 1 
ELEMENTARY ALGEBRA 


1-1 The purpose of this chapter. Fruitful discussion requires some 
common knowledge and understanding. Unless we agree on the meaning 
of words and other symbols, we shall be like players in a game without 
rules, and just as likely to be confused. 

The task of creating a basis for mathematical discussion is simplified 
by the fact that mathematics can be grasped in terms of a very few pro- 
foundly simple ideas. This results in a beautiful unity of logical structure, 
and also makes mathematical knowledge much easier to acquire, organize, 
understand, and recall. It underlies the cumulative character of mathe- 
matics courses, where each step is based on those before, and the whole 
develops in a logical way. 

We take for granted an understanding of everyday language, a back- 
ground of experience such as may be assumed common to the reader and 
the author, and some memories of elementary geometry, algebra, and 
arithmetic. The purposes of this chapter are (1) to remind the reader of 
certain simple ideas associated with numbers, (2) to discuss informally the 
fundamental ideas used throughout the book, (3) to explain the meanings 
that we assign to certain words, and (4) to show how some laws of ele- 
mentary algebra may be derived from others. 

There is an old saying that “a hard beginning makes a good ending.” 
The reader will find the proverb confirmed if he attacks this chapter with 
energy. Of course, he must not expect to master immediately the new 
ideas presented here. They are among the most generative man has 
produced and can be appreciated fully only by meeting and using them 
Mm various forms. 


1-2 Numbers and points. Numbers may be visualized and applied in 
many ways. For example, a positive integer,* such as 1, 2, 3, and 37, may 
be thought of as the result of counting the members of some set of ob- 
jects. A ratio of positive integers, such as 1/3, 4/3, 2/12, and 117/29, 
may be visualized as indicating the number of fractional parts in some 
collection of parts of objects. Zero may be conceived as the number of 
members in a club from which all members have resigned. A negative 


* A word or phrase that is to be used in this book in a special technical sense 
is printed in italics in the sentence that defines it. Other words are used with 
meanings that are familiar to the reader or may be found from a dictionary. 
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number, such as —2, —4/3, —2.75, and —100, may be thought of as a 
temperature below zero or as the balance in an overdrawn bank account. 
The variety of interpretations and uses 1s endless. We concentrate here 
on one that gives us a simple geometric picture of any number. 

We imagine a straight line, as pictured in Fig. 1-1. On it we choose a 
point, call it the origin, and let it serve as the geometrical image of 
the number 0. We select another point (customarily to the right of the 
origin), call it the undt point, and let it correspond to the number 1. The 
direction from the origin to the unit point is the posetie direction; the op- 
posite direction is the negate direction; and the distance between the 
origin and the unit point is the undt distance. We call such a line an azis. 
The scale on a thermometer is a familiar example of a portion of an axis. 


Unit distance .. 


Origin | Ἶ | Unit point 
ee ee ee ee ee ΤΕ; 
—5 -,4 --ὃ -—2 —l 0 I 2 3 4 δ 


Negative direction Positive direction 


Figure 1-1 | 


Every point on an axis serves as the geometric image of a unique num- 
ber. We call the number the coordinate of the corresponding point, and 
the point the graph of the number. If the point is in the positive direction 
from the origin, this number is just the distance between the origin and 
point, measured in terms of the unit distance. If the point is in the nega- 
tive direction, the corresponding number is the negative of the distance 
between the point and the origin. For example, the number corresponding 
to the point 10 units to the left of the origin is —10. We now look more 
carefully at this correspondence between points and different kinds of 
numbers. | 

Positive integers. Yo find the point corresponding to a positive integer 
we lay out the unit distance to the right a number of times equal to the 
positive integer. | 

Integers. The positive integers, zero, and the negative integers are 
called integers. Every negative integer is the negative of some positive 
integer. For example, —6 is the negative of 6. And every positive integer 
is the negative of a negative integer. For example, 6 is the negative of 
—6, since —(—6) is the same as 6. To find the point corresponding to a 
negative integer, we lay out the unit distance to the left a number of times 
equal to the negative of the number. For example, to find the point cor- 
responding to —-6 we lay out the unit distance —(—6), or 6, times to the 
left. It is much easier to describe this by using a letter, say “x,” to stand 
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for any number. Then we can say that if x is a positive integer, we find 
the point corresponding to —z by laying out the unit distance x times 
to the left. 


*(a) Draw an axis and show the origin, the unit point, and the points corre- 
sponding to 6, —6, —2, and 7. Complete the following: (b) The points corre- 


sponding to x and —x are ________ from the origin but in opposite directions. 
(c) If x is a positive integer, the point z is to the _______ of the origin and the 
point —z is to 8. 7 but if x is a negative integer, the point 2 is to the 


of the origin and the point —z is to the 


Fational numbers. A number that is the ratio of two integers is called 
rateonal. Examples are 27/19, —1/2, 1.25, and 3. The last two are ra- 
tional because 1.25 is the same as 125/100 and 3 is the same as 6/2. To 
find the point corresponding to 1/3 we simply divide the unit distance into 
three equal parts and lay out a distance equal to one of them. To locate 
7/3 we lay out the segment of length 1/3 seven times. More generally, if 
x and y are positive integers, we locate the point x/y by dividing the unit 
segment into y equal parts and laying out one of these parts x times to 
the right of the origin. If x and y are positive, we locate —x/y by per- 
forming the same operation toward the left (see Fig. 1-2). Since an 
expression such as “5/0” is meaningless, a fraction with zero denominator 
does not represent a number. (This is explained in Section 1-14.) 


=f =] 0 l 2 
cs - Ὁ τς ey es ee 1.1. 98 - 
3 1 I 7 
""“» "» 3 3 
Figure 1-2 


(d) Draw an axis and show the points corresponding to 4/3, —§/2, —10/3, 
1.25, —0.3. 


We now have a method for locating the unique point corresponding to 
any given rational number, but suppose instead that we are given a point. 
Can we always determine a corresponding rational number? Of course we 
can if the point lies at a whole number of unit lengths or fractional parts 
of the unit length from the origin. But is this so for all points? The answer 
is no. Some points (indeed, countless points) do not correspond to any 
rational number. Suppose we lay out from the origin a segment equal in 
length to the circumference of a circle of diameter 1. The number cor- 


* The exercises inserted in the body of the text are an essential part of the 
discussion and must be read and carried through (not necessarily in writing) if 
the material is to be understood. Answers are given at the ends of the sections. 
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responding to its endpoint is 7, the ratio of the circumference to the 
diameter of a circle. But 7 is not a rational number, although rational 
numbers such as 22/7, 3.14, and 3.1415926536 are used as approximations 
to it. 

Other examples can be obtained in the following way. Let x be the 
length of the diagonal of a square whose sides have length 1. We have 
by the Pythagorean theorem (see Fig. 1-3) that «ἢ = 12 - 1" = 2, and 
¢ = 4/2. We prove in Section 6-8 that there is no rational number equal 
to the square root of 2. Hence if we lay out the diagonal from the origin 
as in the figure, we determine a point that does not correspond to any 
rational number. 


0 1 /2 
Figure 1-3 


A number that corresponds to a point on an axis but is not a rational 
number is called an irrational number. For the time being, we shall always 
replace an irrational number by some rational number that is approxi- 
mately equal to it. Thus we replace \/2 by 1.414 even though (1.414)? = 
1.999396. Similarly we replace 7 by 3.142 even though neither this 
rational number nor any other is exactly equal to the ratio of the cir- 
cumference to the diameter. Table I in the Appendix gives rational 
approximations of the square and cube roots of positive integers up to 
100. 

Real numbers. The rational and irrational numbers are called real 
numbers. Every real number has a unique corresponding point on the axis, 
and every point corresponds to a unique real number. The whole axis 
is a picture of the real numbers, each point being the image of just one 
real number. Because of this correspondence, we visualize and talk about 
numbers as points whenever that is convenient. For example, we may 
speak of the point 6 (meaning actually the point corresponding to 6) or 
of the number 6, as convenient. | 

Decimals. When a real number is given in decimal form, the corre- 
sponding point may be located by rewriting the decimal as the quotient 
of two integers and using the method described above for rational numbers. 
For example, 3.142 = 3142/1000. However, it is easier to think of the 
decimal as the sum of units, tenths, hundredths, and so on. Thus 3.142 = 
3+ 1/10 + 4/100 + 2/1000. Then we locate the point by going to 3, 
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then 1/10 more, then 4/100 more, and finally 2/1000 more. If the decimal 
is negative, we do the same but in the negative direction. 


Sketch each of the following, drawing an axis for each, and using approxima- 
tions from Table I in the Appendix where necessary: (6) 2.1, (ὃ) 3.14, 
(g) 2277, (8) —1.5, (i) —3.85, (j) ν8, (k) W5. 


Imaginary numbers. Not all numbers are real. For example, there is no 
real number whose square is —2. Hence ./—2 is not a real number. The 
same remark applies to / —6, ν΄ —9, 1 + /—3, and so on. Nevertheless, 
such numbers can be defined so that they are as useful and meaningful 
as real numbers. They are called imaginary numbers, although they are 
no more imaginary in the usual sense than the reals. The real and im- 
aginary numbers are called complex numbers. For the time being we shall 
limit our discussion to real numbers. 


PROBLEMS 


1. Which of the following numbers are rational? real? irrational? 2, —3/2, 
V4, V5, ν ἃ — 1, V—4, 1.79, —22/7, 3.14. 

2. Answer the same questions for these numbers: —43.08, ν, 845, »/169, 
—85/9, V2 — V—3, 1, 0, —1, 1+ 2ν8. 
*3. Explain ‘and illustrate with an example the method of dividing a seg- 
ment into several equal parts by ruler and compass construction.* 


ANSWERS TO EXERCISEST 


(a) In order from left to right: —6, —2, 6,7. (Ὁ) equidistant. (ὁ) right, 
left, left, right. (d) In order from left to right: —10/3, —5/2, —0.3, 1.25, 
4/3. (6) Note that 22/7 is not the same as 3.14, though they are approxi- 
mately equal. 


ANSWERS TO PROBLEMS{ 


1, All but V5, V2 — 1, and —4 are rational. All but ~/—4 are real. 
V5 and 1/2 — 1 are irrational. 3. See a geometry textbook. 


1-Ὁ Constants and variables. Written communication is accomplished 
by means of various marks on paper—letters, punctuation marks, con- 
ventional signs, words, abbreviations, phrases, sentences, pictures, dia- 


* Problems marked with a star are optional. They are interesting, sometimes 
difficult, but not essential to the main argument of the book. 

Τ᾽ Answers are given to most exercises and to selected problems. Abbreviated 
answers are sometimes given, but the reader should give answers in full, of 
course. | 
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grams, and so on. We call any intentional mark a symbol. Examples are 
the heading of this section, the previous sentence, the word “two, ” the 
sign “+,” the numeral “18,” and the question mark. A symbol may con- 
sist of just a single mark, or it may be made up of several marks, each of 
which is a symbol. Examples are, respectively, “8” and “753.” We often 
call a symbol an expression, especially if it is made up of several symbols. 
“Symbol” and “expression” are synonymous. 

The significance of a symbol depends upon the way it is used. For 
example, the sound of “a” is different in “hat” and in “hate.” The context 
of a symbol is the collection of symbols among which it appears and the 
other circumstances of its use. The meaning of an expression depends 
upon its context. For example, “2” stands for 20 in the expression “327” 
and for 200 in “271.” 


(a) Compare the meaning of “The Constitution of the United States” in 
1859 and 1954. (Ὁ) Why does a word-for-word translation from one language 
to another seldom preserve meaning? 


Some symbols serve in particular contexts as names of specific things— 
persons, places, tangible objects, or ideas. Examples are “the Pacific 
Ocean,” “the Eiffel Tower,” “the English alphabet,” “the President of the 
United States,” and “New York City.” A symbol that in a particular con- 
text is the name of just one specific thing is called a constant. Grammarians 
call constants proper nouns. The thing that a constant names is called its 
value. A constant stands for, or names, its value. The value of a constant 
is its meaning. The symbol “F.D.R.” is a constant standing for the late 
Franklin D. Roosevelt. The man F.D.R. is the value of the symbol 
“F.D.R.” A symbol that stands for a number is called a numeral. For 
example, the numeral “2” stands for the number 2. We say also that “2” 
has the value 2. 

The distinction between a constant and its value is simply the dis- 
tinction between a name and the thing it names. A constant always names 
its value. If we wish to name a constant or other symbol, we do so by 
enclosing it in quotation marks (as illustrated in the above paragraphs*). 
The quotation marks are sometimes omitted when no confusion is likely. 
However, the distinction between a symbol and its value is important 
if language is to be understood. When we communicate we do so not 
by using the objects about which we are talking, but by using their names. 
For example, to talk about the number 7 we use “7,” “VII,” “seven, " 
or some other constant standing for it. 


* As the reader will note, it is customary to place the closing quotation mark 
outside a comma or period immediately following the expression, even though 
the comma or period is not part of the expression. 
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(c) Distinguish between “27” and 27. (d) Why is the Pacific Ocean not a 
constant? (6) Describe exactly what you see below: 


A Inve Cobra 


(f) Why is “a chair” not a constant? (g) The English alphabet consists of 
26 letters, yet we call “the English alphabet” a constant, i.e., the name of one 
specific thing. Why? 


Many symbols are not constants. Obvious examples are punctuation 
marks, which certainly do not name anything. But many nouns do not 
name one specific thing. For example, in “A chair has four legs,” the word 
“chair” does not name any particular chair. In mathematics when we 
wish to refer to an unspecified object of a certain kind, we usually use 
some arbitrary letter or other special symbol, such as “xz.” For example, 
the statement that any integer equals itself divided by one could be writ- 


{,,. 7) 


ten x/1 = x, where we use “x” to refer to an unspecified number. Here 
“ce” is not a numeral, but it stands for any number. By this we mean 
that any constant standing for a number (that is, any numeral) may be 
substituted for it. For example, we may write 2/1 = 2 or 3/1 = 8 or 
any other expression obtained by substituting a numeral for x. Similarly, 
in “A chair has four legs,” “a chair” stands for any chair in the sense that 
we may substitute for it the name of any particular chair. 

A symbol that, in a particular context, is not a constant but for which 
any one of certain constants may be substituted is called a variable. For 
example, “x” is a variable in the context x/1 = z, and “a chair” is a vari- 
able in the context “A chair has four legs.” The constants that may be 
substituted for a variable in a particular context are called significant sub- 
stetutes for it. The value of a significant substitute is called a value of the 
variable in a particular context. In our examples, “2” is a significant 
substitute for “x” and “the first chair in the first row” is a significant sub- 
stitute for “a chair.” The number 2 and the first chair in the first row are 
the corresponding values. 

Note that a constant has just one value, whereas a variable has more 
than one value. This means that a single object is associated with a con- 
stant, whereas a collection of several objects is associated with a variable. 
The variable may be thought of as standing for some unspecified object 
in this collection. 


Cite some values of the variables in: (8) “the point x and the point —z 
are equidistant from the origin”; (i) “Mr. KX”; (j) “the price of y”; 
(k) “the response to r.” 


A handy method of creating variables is to place an additional symbol 
(called a subscript) to the right and below another. Thus we may create 
an unlimited number of different variables from x by writing σι (read 


8 ELEMENTARY ALGEBRA [cHaP. 1 


“x-sub-one”), 12, 23, Lk Ln, and so on. Often the same letter with different 
subscripts is used to indicate different variables having the same values. 
Thus we might speak of the men M, and Me, the women W, and Wa, and 
the numbers 2; and 172. 

Most of the confusion concerning variables and constants is avoided if 
a careful distinction is made between symbols and their values. Since a 
variable is a placeholder for which constants are to be substituted, it ap- 
pears in its context in the same way as if it were a constant. But a variable 
is not the name of any particular thing and has no definite value. It is 
merely a symbol used to indicate a place in which a constant may be sub- 
stituted. It is a blank, or hole, to be filled in a certain way. Thus in 
“2 +2,” “x” appears in the same way as does “2” in “3 + 2,” but “x” 
is not the name of any particular number. 


PROBLEMS 
Describe the values of the variables in Problems 1 through 4. 


1. “242.” 

2. “A better tennis player than x.” 

3. “Thus, if the demand for commodity A increased relative to the demand 
for commodity B, this would cause more of A to be produced.” 

4. “Economic theory can tell us absolutely nothing more than that for the 
attainment of a given technical end z, y is the sole appropriate measure or is 
such together with y1 and yz respectively, and that their application and thus 
the attainment of the end x requires that the ‘subsidiary consequences’ Z, 21, 
and zo be taken into account.” (Max Weber, The Methodology of the Social 
Sciences, 1949, p. 37) 


5. Distinguish carefully between the meaning and role of “chair” in “Bring 
a chair” and “Bring that chair in the corner.” 

6. Does “Mr. X” name anyone? When “X” is replaced by a man’s name, 
what happens to “Mr. X”? Does it make sense to ask what happens to Mr. X? 

7. Is it true that the value of a constant never changes? 

8. Does a variable change? How can variables be used to indicate change? 

9. There is only one value of x for which 2x = 4. Does this mean that x 
is a constant in 2x = 4? 
*10. “In language, which is the most amazing symbolic system humanity 
has invented, separate words are assigned separately conceived items in ex- 
perience on a basis of simple one-to-one correlation.” (S. K. Langer, A Theory 
of Art, 1953, p. 30) Comment. 
#11. Write a brief essay on the different dictionary definitions of “variable.” 
*12. In a magazine advertisement we read, “Everything you need for baking a 
cake is right on this page.” Comment. 


ANSWERS TO EXERCISES 
(a) Amendments have been added. (Ὁ) Meaning depends on context. 
(c) The first is a numeral, the second a number. (d) It isa body of water, 
not a symbol. “The Pacific Ocean” is a constant. (6) Nota live cobra! (ἢ 
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It names no particular chair. (g) It names a single object, which is a collec- 
tion, or set, of letters. (h) 4, —4, —1.7, 1/3, —\V/7. ὦ Jones, Smith, etc. 
(j) Butter, sugar, etc. (Κ) A pin prick, a letter, ete. 


ANSWERS TO PROBLEMS 


1. Numbers. 2. People. 3. Commodities. 5. The first refers to any un- 
specified chair, the second to a particular chair. 6. No. It becomes a constant. 
7. It changes with the context. 8. No. By substituting different constants for 
them. 9. No. It has many values in the equation, even though only one of 
them yields a true statement. 


1-4 Vectors. Suppose an aircraft flies due east 75 miles. We may 
picture its flight by an arrow as in Fig. 1-4, where an axis has been chosen 
so that the beginning of the flight is at the point —10. If the aircraft had 
flown 35 miles due west, the arrow would point in the other direction, as 
in the figure. We have labeled this arrow —35 instead of 35 in order to 
indicate its direction. Clearly, any motion in the east-west line can be 
indicated by an arrow whose length is the distance moved and whose 
direction is the direction of motion. Corresponding to each such arrow 
there is a unique real number equal to the length of the arrow if it points 
to the right and the negative of this length if it points to the left. 

—35 
75 


SS . τ᾿’ ᾿ 


τοῦ —40 —30 —20—-10 0 10 20 30 40 50 60 70 


FIGURE 1-4 


Arrows such as those pictured in Fig. 1-4 are sometimes described as 
directed distances and called vectors. We see that every real number de- 
termines a unique vector and, conversely, that every horizontal vector 
determines a unique real number. 


Complete the following: (a) The vector corresponding to the real number 7 
is a vector of length_______ pointing to the ______. (b) The vector corre- 
sponding to —7 is of length ________ pointing to 6. . (c) The 
vector corresponding to the number z is the vector of length x pointing to the 
right if ¢ is ______, and is the vector of length —x pointing to the 
if x is : 

Wherever a vector is placed, we may think of it as the directed distance 
from its initial point to its terminal point (see Fig. 1-5). For example, 3 is a 
directed distance of three units toward the right and may be visualized 
as a vector of length three pointing toward the right. It is the directed 
distance from the origin to the point 3, the directed distance from the point 
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Initial point Terminal point 
.----..-... θὖθπτρΡΠΡΠρὃἕτΠΠ- τ τ τΟυΟῤἵ΄Ὸ---.----ο- 


Directed distance from 
initial point to terminal point 


FIGURE 1-5 


1 to the point 4, and the directed distance from any point to a point 3 
units to the right of it. On the other hand, —3 1s a directed distance of 
three toward the left and may be visualized as a vector of length three 
pointing toward the left. It is the directed distance from the origin to the 
point —3 and from any point to another point 3 units to its left (see 
Fig. 1-6). 


3 
«--.--.--.---- - ὕ.. 2 
««-------.... ὁ 3 . 8 


—§ -4 -—3 - --ἰ 0 1 2 3 4 ῦ 6 7 


FIGuRE 1-6 


From now on we shall think of numerals as standing for numbers, points 
on a real axis, or vectors, according to convenience. For example, we use 
“1” to stand for the number —1, the point —1 (the point one unit to 
the left of the origin), or the vector —1 (the vector of length 1 pointing to 
the left). 


Draw an axis for each of the following and the vector given by the number 
with its initial point at the origin and in two other positions: (d) 5, (6) —85, 
(ἢ —1.75, (Ὁ 14 V2. 

Complete the following. (h) If the vector z is placed with its initial point 
at the origin, its terminal point will be at the point 


Addition. Among the many uses of vectors is the vector interpretation 
of the operations of arithmetic. Suppose an aircraft flies 75 miles east, 
then 35 miles more in the same direction. Its total flight is given by adding 
the numbers or by adding the vectors, as in Fig. 1-7. The addition, 
75 + 35 = 110, and its vector picture can be given many interpretations. 
For example, if two forces of 75 and 35 pounds are pulling in the same direc- 
tion, their resultant is the force of 110 pounds. 

We define the sum of two vectors as follows: To find the vector a + ᾧ, 


FIGURE 1-7 
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lay the vector ὃ with its initial point on the terminal point of a. Then 
a + 6 is the vector from the initial point of α to the terminal point of ὃ 
(see Fig. 1-8). In this definition the values of the variables “a” and “b” 
are all real numbers, i.e., all vectors of the kind we are ea With 
this definition, vector addition corresponds exactly to numerical addition. 
For example, 7 + (—3) = 4 by the rules of arithmetic; and the same 
result is obtained by adding the vector 7 and —3 according to the rule 
(see Fig. 1-9). 


a+ ὃ 4 —3 
Sr — Pe ὁ 
a b 7 
—_—_e Ol —_—_—_—_—_—_———————————————_————_————— 
FIGURE 1-8 FiIGuRE 1-9 


In every case we may visualize a + b as the motion given by the vector 
a followed by the motion given by the vector b. For example, as illus- 
trated in Fig. 1-9, going 7 units to the right then 3 to the left is the same 
as going 4 to the right. This interpretation of addition in terms of motion 
is very old, as illustrated by the fact that the ancient Egyptian symbols 
for “plus” and “minus” were pairs of legs walking in different directions. 
It is worth while to cultivate the habit of visualizing addition in this way, 
since it gives a check on accuracy, a quick way of finding sums by “just 
looking,” and an important tool for dealing with distance. 


Draw vector diagrams for the following: (i) 3.2 + 4.5, (4 8-+ (—8), 
(k) —4+ (--2,Ὠ Ο —4.14 6. 

Complete the following: (m) If ais placed with its initial point at the origin, 
then the terminal point of the vector a + ὃ is at the point ὶ 


We now give vector interpretations of other arithmetical concepts. 

Negation. The expression “—a” stands for the number obtained from a 
by the operation of negation. If a is a positive number, say 17, then —a 
is the corresponding negative number, —17. If a is a negative number, 
say —7/, then —a is the corresponding positive number, 7. The points a 
and —a are at the same undirected distance from the origin, and the vec- 
tors a and —a have the same length. If α is a vector pointing toward the 
right, its length is just a. If ais a vector pointing toward the left, its length 
is —a. For example, the length of —7 is —(—7) or 7. We deapaate the 
length of the vector a by |a|, which is also called the absolute value of the 
number a. It is the undirected distance between the initial and terminal 
points of the vector. The reader should note that in this discussion both 
negative and positive numbers are values of a. Thus a may be positive or 
negative, so that —a may be either positive or negative. Indeed if a is 
negative, —a is positive. The length of a is therefore a if ais positive and —a 
1} a rs negative. 
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Sketch x and —z for each of the following values of x. Find |z| in each case. 
(n) δ, (ο) —5, (p) V2, (q) —10, (r) —7/.1. 


Subtraction. The difference a — b is the number that yields a when it 
is added to ὃ, that is, ὃ + (a — b) = a. For example, 7 — 5 = 2 be- 
cause 5 + 2 = 7. We may find the vector a — ὃ by placing the initial 
points of a and b together and then drawing the vector from the terminal 
point of ὃ to the terminal point of a, as suggested in Fig. 1-10. 


Sketch the following as in Fig. 1-10: (s)5—2, (ὃ 7— 10, (ὦ) 
—3 — 4, (v) —4 — (3). 


From the above examples it is evident that a — b is just the vector 
from the point ὃ to the point a, that is, the directed distance from ὃ to a. 
Hence we may always visualize a subtraction as the directed distance, or 
vector, extending from the subtracted point to the other point. The directed 
distance from one point to a second point is found by subtracting the first 
(beginning) point from the second. In Fig. 1-11 we use subscripts to em- 
phasize this idea. The absolute value of the difference gives the undirected 
distance between the points. | 


tg — & 
a een ὦ 
a ence ane caret oo: ao 
Ee ee ee ee 11 Χο 
FIGURE 1-10 Figure 1--11 


Multiplication. The product of a and ὃ, symbolized by “ab,” “a- 6,” or 
“a Χ ὃ," is the number obtained from the factors a and ὃ by multiplication. 
When a is a positive integer, a - ὃ is obtained by adding ὃ to itself a times. 
In terms of vectors, a - ὃ is found by laying ὃ out a times. If a is negative 
we interpret this to mean that ὃ is laid out a times in the negative direction 
or, what comes to the same thing, — is laid out a number of times equal 
to —a. (Note that —a is a positive integer here, since a is a negative 
integer.) This is illustrated in Fig. I-12. 

If a is not an integer but is rational, a-b is found by laying out ὃ a 
certain integral number of times and then laying out some fractional part 


(—4)-2 or 4(—2) 4.2 
————— 0. ΓΤ τ τς 
—2 —2 —2 —2 2 2 2 2 
no ans — ee Oe Oe 
~—10 —§ 0 ὃ 10 


FIGURE 1-12 
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Figure 1-13 


of it. Thus (2.7) - (1.5) is found by laying out 1.5 twice, then laying out 
7/10 of it (see Fig. 1-13). This seven-tenths can be found by arithmetic, 
since (7/10)(1.5) = (7)(1.5)/10 = (10.5)/10 = 1.05. Or it may be found 
by dividing the segment 1.5 into 10 equal parts and laying out 7 of them. 
If a is irrational, we shall replace it by a rational approximation. (A more 
satisfactory procedure is suggested in Section 6-9.) 


Sketch: (w) 2+(—7), (x) ./2- (2.9). 


Division. The expression “a + b” (or “a/b”) means the number that re- 
sults from dividing a by b. It is the number that yields a when it is multi- 
plied by ὃ. Thus 6/3 = 2 because 6 = 3-2. If bisa positive integer, we 
can find a/b by dividing a into b equal parts. In other cases, the geometric 
interpretation is not so simple, but in every case if the vector a/b is mul- 
tiplied by ὃ according to the vector interpretation of multiplication, the 
result is a. 


Sketch: (y) 5/3, (2) —18/5. 


PROBLEMS 


Sketch the following with vectors. 


1. —2.35 + 1.28. ὅς. 10. =(3.0), 3. (—24.1)(—1.9). 
4. (5.2)(—3.1). 5. = 7:1 -- 8.01, 6. —5/2. 


For the following draw vectors with initial and terminal points as given (in 
that order) and verify that the appropriate difference gives the vector. 


7. 3and 8. 8. 8 and 3. 9. —1 and 4. 
10. —3 and —9. 11. 1.1 and —1.1. 12. 7 and —1. 
13. —14.2 and 14.2. 


14. If the initial point of the vector x is placed at the point a, where will 
its terminal point be? 

15. What are the values of the variables in this section? 

16. What are the significant substitutes for them? 

17. Is “O” a significant substitute for “b” in the expression “a/b?” (Hint: 
What number would be represented by “a/0” for any particular value of “a”?) 

18. The expression (Deaths - 1000)/(Population) gives the death rate in 
deaths per thousand inhabitants. (a) What are the variables? (b) What are 
the constants? (c) Cite some values of the variables. (d) Calculate roughly 
the value of the expression corresponding to deaths of 1,340,000 in a population 
of 119,000,000 (figures for the U.S. in 1930). 
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ANSWERS TO EXERCISES 


(a) 7,right. (Ὁ) 7, left. (ὁ) Positive, left, negative. (6) Use approxima- 
tion from Table I in the Appendix. (h) zx (ὃ Fig. 1-14. (m) a+b. 
(0) Fig. 1-15. (p) |x/2| = νῷ. (q) [--10] = —(—10) = 10. (Ὁ |—7.1| = 
—(—7.1) = 711. (s) See Fig. 1-16. (2) See Fig. 1-17. 


--4.1 1.9 


Figure 1-14 


--ὃ 0 δ 
Figure 1-15 
i ee 
2 § —2 


Figure 1-16 


Figure 1-17 


ANSWERS TO PROBLEMS 


2. See Fig. 1-18. 7. Note that the initial point is to be subtracted; in this 
case 8 —3 = 5. 8.3 —8 = —5. 14.2 +a. 15. Numbers. 16. Numerals. 
17. No. See Section 1-14. 18. (a) “Deaths,” “Population.” (Ὁ) “1000.” 
(c) Say 25 and 1000 for “D” and “P.” (d) (1,340,000,000)/(119,000,000) = 
1340/119 = 11.3. (We use = to mean approximate equality.) 


Figure 1-18 
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NORTHFIELD, Minn. __——s1'9__Noo. 


THE FIRST NATIONAL BANK 75-147/912 
PAY TO 


THE: ORDER: ὃ ha ee τ--..ςς. 
Dollars 


Figure 1-19 


1-5 Formulas. Everyone has had some experience with forms in which 
blanks are to be filled in. For example, the form pictured in Fig. 1-19 
becomes a bank check when correct entries are made. Evidently the 
blanks in the form are variables, since constants are to be substituted 
for them. 

The check form is convenient for filling in, but it is easier to discuss if 
we write it in one of the following ways. 


Northfield, Minn. Month, Day 19year No. Number The First 
National Bank 75-147/912. Pay to the order of payee $ 
Amount (in a decimal) Amount (written out) Dollars. Payer 
(signed) 


(1) 


(2) Northfield, Minn. m, d 19y No. n The First National Bank 
75-147/912. Pay to the order of x $a, A Dollars. z. 


Form (1) makes clear the nature of the significant substitutes for the 
variables. Form (2) is more compact and easier to talk about. For ex- 
ample, we may say that the value of “a” must be the same as the value of 
“A” if the check is to be valid. 


(a) Describe the values of each variable in Form (2). (Ὁ) Why are letters 
more convenient variables than blanks? (c) What happens to the check if a 
is larger than the balance in z’s account? (4) Restate the question in (c) 
using words for variables. (6) What do we call “z” if it is written by someone 
other than z or an authorized agent? (f) Name some constants in the check 


form. 


Some expressions in the check form are neither constants nor variables. 
Among these we observe that certain expressions, such as 19__, become 
constants if appropriate constants are substituted for their variables. 
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For example, 19... becomes 1955 if 55 is substituted for the blank. An 
expression that contains at least one variable and that becomes a constant 
when significant substitutions are made for all its variables is called a 
formula. The entire check form is a formula, because when significant 
substitutions are made for its variables it becomes a valid check, i.e., a 
name of an order to pay. In the previous sections we have used such for- 
mulas as x, -ὦ, —a, a+b, a — 6, ab, and a/b. A single variable is a 
formula according to this definition. 

When significant substitutes displace the variables in a formula, the 
formula becomes a constant. The value of any such constant is called a 
value of the formula. A formula, like a variable, has no uniquely de- 
termined value. Rather, it has many values, each one determined by sub- 
stituting constants for the variables that appear in the formula. Significant 
substitutes for the variables in a formula are those that yield values of the 
formula, i.e., those that transform the formula into a constant. We say 
that formulas stand for their values and that they take certain values when 
constants are substituted for their variables. 


In each of the following formulas indicate the variables, some values of the 
variables, and the corresponding values of the formula. (g) “.-ὄ - 119. ἢ 
in Fig. 1-19. (h) “Mrs. A, this is Mr. B.” (i) “Area equals width times 
length.” (j) “Profit equals gross income minus costs.” (k) “a, 
i, ne arm, 
_ ig, Lis.” = (1) “TT ______ take thee _____._ to be my _ wedded 
wife.” (m) a — ὦ, —a, a/b. 


We call constants, variables, and formulas terms. Terms are symbols 
that have values. Not all symbols are terms. For example, punctuation 
marks and individual letters in words are not terms. A term is able to 
stand alone in the sense that it either names something (if it is a constant) 
or is a formula that names something when constants are substituted for 
its variable or variables. 

Whether or not a symbol is a term depends on the context. For ex- 
ample, “moi” is a term in French but not in English. Most of the time, in 
mathematical as well as in other writing, the reader is supposed to be able 
to tell from the context what the significant substitutes and values of the 
variables are. This is easy when words are used as variables, since they 
usually name their own values. When letters are used as variables, the 
values must be determined from the context, unless specific indications are 
given. In formal mathematical theory all possible confusion is avoided 
by spelling out exactly which symbols are terms and what substitutions 
are allowed. 


Which of the following are terms? Justify your answers. (n) “2a,” (0) “Mr. 
X+1,” (p) “ae = ,” (q) “ex --ἰἦς -- @+42)),” (Ὁ “a/ .” 
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We say that an expression is in the form of a certain formula if it can be 
obtained from the formula by some substitutions for the variables without 
any other operations. For example, “2?” is in the form of “a?” because it 
can be obtained from “a?” by substituting “2” for “a.” However, 4 is not 
in the form a”, because to get 4 from a? we must substitute 2 and then 
replace 2 by 4. We say that two expressions are in the same form if they 
are in the form of the same formula. For example, 27 and 3? are in the 
same form since they are both in the form a”. 


(s) Show that 6? and 4° are in the form a?. (0) Show that 22, x2, and ὃν 
are in the form x¥. (u) Why is 2/3 not in the form a/a? 


PROBLEMS 


1. Why is a variable a formula? 

2. Illustrate how the values of a variable may be determined from the con- 
text by discussing “Mrs. A” and “x — 3y.” 

3. When two trees have almost the same circumference, the following formula 
is sometimes used to determine which is bigger: the number of inches of circum- 
ference at 4.5 feet above the ground is added to the height in feet plus one-fourth 
of the crown-spread in feet. (a) State this formula in an equation, using words 
for variables. (b) Do the same, using letters. (c) In 1946 a white pine near 
Newald, Wisconsin, was reported to have a height of 140 feet, a crown-spread 
of 56 feet, and a circumference of 17 feet 2 inches at 4.5 feet above ground. 
Calculate the number of points by substituting in this formula. 

4, R. C. Angel devised the following formula for measuring the “welfare effort” 
of a city in a community chest campaign: effort = (amount raised) /(quota) + 
(pledgers)/(number of families in area) + (amount raised)/(0.0033 yearly 
retail sales). What is the significance of each term? In what numerical range 
would you expect the scores to lie? (Communities on the average contribute 
about 1/3 of 1% of retail sales.) 

5. The distance traveled by a body moving at constant speed is equal to its 
speed multiplied by the time elapsed. Rewrite this sentence, using letters for 
variables. 

*6. Show that an expression may be in the form of several different formulas. 
*7. Argue that the meaning attached in this section to “in the form of” is 
not inconsistent with its use in such expressions as “The church was constructed 
in the form of a cross” and “The sonata is in the form A, B, A.” 


ANSWERS TO EXERCISES 


(a) m: months, 12 values; d: days, 31 values; y: years, 100 values; n: natural 
numbers; z: people; a: amounts of money. (0) Difficult to distinguish different 
blanks. (ὁ) It bounces! (d) Replace a by “the amount” and z by “the 
payer.” (6) A forgery. (ἢ) “Northfield,” “Minn.,” “The First National 
Bank.” (4) For each substitution the formula becomes a name of a day, and 
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its value is that day. (h) A: women; B: men; formula: introductions. (1) 
Area: numbers; width: numbers; length: numbers; formula: statements of 
equality. (j) Like (i). (1) This formula is a paradigm in which the blank 
is to be filled in with the stem of a Latin noun. Values of the formula are declen- 
sions. (1) Blanks: men, women; formula: pledges. (τι) a, 6; values are 
numbers. (n) Yes. (0) No. (p) No. (4) Yes. (r) No. (s) ὃ for a 
and 4 for α. (t) 2 and 2, x and 2, and 5 and y for z and y, respectively. 
(u) Any substitution yields same numerator and denominator. 


ANSWERS TO PROBLEMS 


1. A variable is an expression containing at least one variable and becoming 
a constant when its variables (just itself) are displaced by appropriate con- 
stants. 3. (a) Size = circumference -++ height + (1/4)(crown-spread). (Ὁ) C+ 
H+ 8/4. (ec) 360. 


1-6 Equations. A constant has only one value in a given context, but 
different constants may have the same value. For example, “Minnesota, ” 
“the land of ten thousand lakes,” and “the thirty-first state” all have the 
same value. 

When two constants have the same value (are names of the same thing) 
we call them synonymous. When we wish to indicate that two constants 
are synonyms, we write an equals sign between them. By 1/2 = 4/8 
(1/2 equals 4/8) we mean that “1/2” and “4/8” are names of the same 
thing; that is, 1/2 and 4/8 are one and the same thing. 


Cite several constants synonymous with (a) “New York State,” (b) “zero,” 
(c) “.ἌἝΨᾳὌΦΑΔΕο. 1 é 7 


We call an expression of the form « = y an equation, in which z and y 
are the sides or members. The statement x = y is true if and only if x 
is ddentical with y. The relation of equality, represented by the equals 
sign, is the relation that a thing has with itself and with nothing else 
We write x ~ y for “zx is not equal to y.” 


(d) Restate a = bin words in as many ways as youcan. (e) Whyis2 = II 
true even though “2” and “II” are obviously different? (ἢ) If a ~ ὃ, what 
do we know about a and b? Κ(ρ) One of the axioms of Euclid was, “If two 
things are equal to the same thing, they are equal to each other.” Restate this 
so as to avoid any confusion as to the meaning of equality. (8) Write ten 
expressions having the value 10. 


Sometimes “equals” is used loosely to mean merely that some aspect 
of two things is the same. Thus it is customary in high-school geometry 
to say that two triangles are equal when their areas are the same. Accord- 
ing to the meaning we have given “equals,” AABC = ADEF if and only 
if the two symbols (“A ABC” and “A DEF”) stand for the same triangle. 


1--6] EQUATIONS 19 


which can be so only if “D,” “HZ,” and “F” are just different names for the — 
vertices of A ABC. Hence if we wish to indicate that two triangles have 
the same area, we write (Area A ABC) = (Area ADEF), which is true 
when each side of this equation stands for the same number. Similarly, in 
The Declaration of Independence, “all men are created equal” means that 
the rights of all men are (or should be) the same. This could be stated 
more precisely (though less effectively from a literary point of view) by 
writing “all men should have equal rights. ” 

We can always avoid using “equals” loosely by indicating the particular 
aspect of the two things that is actually identical, as we have done in the 
previous paragraph, or else by adopting a special symbol. For example, in 
geometry we write A ABC ~ A DEF to mean “(The shape of AABC) = 
(The shape of A DEF),” and write x & y to mean that x and y have the 
same shape and size. In this book, the equals sign always means identity, 
and τί 18 not used at all in the loose sense. 


(i) What aspect of the two lines a and ὃ is the same when a is parallel to b? 
(j) What precisely is meant in plane geometry when it is said that two seg- 
ments are equal? (k) Can two different numbers be equal? How then can 
we say that 2 = 10/5? 


An equation involving no variables is either true or false. For example, 
1.5 = 3/2 is true, and 1/2 + 1/3 = 1/5is false. However, if an equation 
involves variables, we cannot say that it is true or false until constants are 
substituted for the variables. For example, 2x = 6 1s neither true nor 
false. If we substitute “3” for “x” we get the true equation 2-3 = 6, 
and if we substitute “4” we get the false equation 2-4 = 6. Equations 
that are true for some values of their variables and false for others are 
called conditional equations. Equations that are true for all values of their 
variables, such as x + x = 27, are called identities. 


Which of the following are conditional equations and which identities? 
(l)a+b=b6b+a, (m)1—2r = 3, (n) α = a-a. 


A value of a variable in a conditional equation that makes the equation 
true is called a root. For example, 3 is a root of 2x = 6. 


Show that the following are roots of the indicated equations: (o) 2 of 
6 = 3x, (p) —30fz7 = 9, (q) —40f7 +2 = 3, (2) lof(« — 1)z =0, 
(s) 2o0f 1 — 85) = 22 — 9. 


The roots of an equation whose members stand for numbers are not 
altered by adding the same term to or subtracting the same term from both 
members, or by multiplying or dividing both members by the same non- 
zero term. Hence in solving equations we may add, subtract, multiply, or 
divide provided we perform the same operation on both members and take 
care not to multiply or divide by zero. (The justification for this is given 
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in Sections 2-6 and 2-8.) For example, to solve 6 = 3x we may divide 
both members by 3 to get 2 = 7. 


Solve the following by using the ideas in the previous paragraph. 
(0) Σ -- [ὃ = 2, (u) ὅτ — 10 = 0. 


The members of an identity may be constants or formulas. In either 
case we call them synonymous and we say that either is a synonym of the 
other. This extends the use of “synonymous,” which was defined at the 
beginning of this section with reference only to constants. Synonymous 
expressions always have the same values. This means that if they are 
formulas, any significant substitution of constants yields constants with 
the same value. 

PROBLEMS 


Which of the following are identities? (Suggestion: Try values of the vari- 
ables.) Solve those that are conditional, if you can. 


1. l/a-+ 1/6 = 1/(a+ δ). Ὁ, ee ae 

3.a—1 = 2a — (a+ 1). 4. 20 -᾿ ς = 2(b+ ὦ). 

5. ab = ba. 6. α(ὖ -Ἐ 6) = ab+ ας. 

7. 81 = 12. 8.22 -- 1 Ξ 5: - 4. 

9. 152+ 8.1 = 0.42 -- 8. 10. 2.82 — 10.9 = 3.4 — 1.92. 


11. (« — 1)? = 9. 
*12. Find two values of x for which x? +- 1/z? = a? + 1/a?. 

13. Show that —3 is a root of 2? — 37x = 84. 
*14. A certain company advertises that its coupons may be redeemed as 
follows: a teaspoon for 5 cents and 34 coupons or for 20 cents and 2; a solid- 
handle knife for 5 cents and 89 or 50 cents and 3; a hollow-handle knife for 5 
cents and 129 or 85 cents and 3; a tablespoon for 5 cents and 69 or 40 cents and 
3; a butter knife for 5 cents and 69 or 40 cents and 3, a cold-meat fork for 5 
cents and 109 or 75 cents and 7. (a) Do the coupons have the same value for 
each premium? (b) What amount do you think was used by the company 
as the value of a coupon? (c) Using this value, which alternative is cheaper? 
(The first method in each case is called “The Thrift Plan.”) (Suggestion: Let 
z be the value of a coupon. Then the two ways of getting a teaspoon cost 
5 + 845 and 20 -+ 2x. Equating them, we get an estimate of z.) 
%* 15. According to Wittgenstein (Tractatus Logtco-Philosophicus, Ὁ. 139), 
“ |. . to say of two things that they are identical is nonsense, and to say of one 
thing that it is identical with itself is to say nothing.” Comment on this in 
relation to the preceding discussion of equality. 


ANSWERS TO EXXERCISES 


(a) “The Empire State,” “the state whose capital is Albany,” etc. (Ὁ) 0, 
2— 2,0/5,ete. (c) 5 — 6,13 — 2.3, --δ΄- 7. (d) “a” has the same value 
as “b,” a is the same as ὃ, etc. (6) The constants are different, but they have 
the same value. (f) a and ὃ are different in some respect. (g) If two symbols 
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are each synonymous with a third, they are synonymous with each other. 
(ἢ) 2-5, 1-10, X, 20/2,6-++ 4,11 — 1, ten, 15 — 5,9-+ 1,10!. (i) Direc- 
tion or orientation. (7) That their lengths are the same. (k) No, two dif- 
ferent things cannot be the same. Although “2” and “10/5” are different con- 
stants, they have the same value. (1) Identity. (m) Conditional. (mn) Iden- 
tity. (0) through (s) By substituting and simplifying. (Ὁ) 5. (u) 2. 


ANSWERS TO PROBLEMS 


3, 5, and 6 are identities. 7. 4. 9. Adding —0.4z and —3.1 to both sides, we 


get 1.10. = —11.1. Dividing both sides by 1.1, we get x = —11.1/1.1 = 
—111/11. 10. 148/42. 11.2 -- 1 = 3 or x—1 = —3. Hence x = 4 or 
x = --ῶὥ. 14. (a) They appear to have taken 4 cent as the value of the coupon. 


(0) The first method is cheaper. 


1-7 Sentences. The reader undoubtedly identifies 
(1) “2/3 is the reciprocal of 3/2” 


as a complete declaratory sentence. It is a constant because it expresses 

(is a name of) an idea. It makes sense to say that (1) is true or to say that 

(1) is false. We say that it is a statement whose value is a proposition. 
On the other hand, consider 


(2) “7/3 is the reciprocal of 3/2.” 


This is not a statement, although it does have the same form as (1). 
It makes no sense to say that it is true or to say that it is false. However, 
(2) does become a statement when we substitute properly for x. Evidently 
(2) is a formula whose values are propositions. 


(a) What values of x make (2) true? 


An idea that can be characterized as either true or false (but not both) 
we call a proposition. A constant whose value is a proposition is called a 
statement. A formula whose values are propositions we call a propositional 
formula. We call both statements and propositional formulas sentences. 
Sentences are simply terms whose values are propositions. 

In formal mathematical theories it is customary to indicate explicitly 
what terms are sentences and how truth is to be established. When such 
rules are not stated in connection with a discussion, a good common- 
sense way of deciding whether an expression is a statement is to ask “Does 
it make sense (whether or not it is correct) to call the alleged statement 
true?” To test an alleged propositional formula one sees whether state- 
ments can be obtained from it by substitution. 
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Which of the following are sentences? statements? propositional formulas? 
(b)2=4, ()r-—y=—(y—2), (d)t=—z, (6) ab+e = 2. 

(f) Are all equations sentences? 

A sentence that involves no variables must be a statement, and 15 either 
true or false. A sentence involving one or more variables is neither true 
nor false, since it is a propositional formula. But some substitutions may 
yield true statements, and others may yield false statements. Values of 
the variables that yield true statements are called solutions. Solutions are 
said to satisfy the sentence. As examples, 2 is a solution of “x + 2 = 4” 
and Herbert Hoover is a solution of “x was a mining engineer and x was 
President of the United States.” 

Finding all the solutions of a sentence is called solving the sentence. 
Variables whose values are sought are often called unknowns, and other 
variables that may appear in the same sentence are called parameters. 
For example, if we wish to solve the sentence “x + a = 3” for 2, x 1s the 
unknown and a is a parameter. Similarly, if we wish to solve the sentence 
“y is a son of x, and x is a male” for 2, x is the unknown and y 15 a param- 
eter. The solution is evidently (the father of y). This can be tested by 
substituting “the father of y” for x to get “y is a son of the father of y, and 
the father of y is a male,” which is certainly true for any appropriate 
value of y. 

The oldest, and quite legitimate, way of finding solutions is to guess and 
experiment. Use this method to find a solution to each of the following. Check 
your guesses. (g) 82 = 12. (h)e—4=3. 1 Ξ 4. (j) Dwight 
Eisenhower followed x in office. (k) 22 is the largest integer less than 5. 

In the previous section we discussed sentences of the form x = y. 
Many mathematical ideas are expressed in such sentences, but by no means 
all of them. As a simple example, when the numbers x and y are unequal, 
one of them is always smaller than the other. When the point z lies to the 
left of the point y on the real axis, we say that x 18 less than y. Symbolically 
we express this idea by writing x < y (ὦ is less than y) or y > ἃ (y ts 
greater than x). Since all negative numbers are less than zero, and all 
positive numbers greater than zero, we use “x < 0” and “x > 0” as ab- 
breviations for “x is negative” and “z is positive,” respectively. We refer to 
a sentence of the form x < y ory > «as an tnequality and to x and y as 
the members. | 

For what values of ris (1) « < 0 false, (m) x > Ofalse, (n) —z < 
false, (0) —x > 0 true? 


The solutions of an inequality often are infinite in number. For ex- 
ample, 2a < 12 15 satisfied by any number less than 6. The solutions make 
up all the axis to the left of 6, as indicated in Fig. 1-20. 


Sketch the solutions of the following sentences: (p)z<-—l, (q) 
22 > 12, (Ὁ) 82 « —9. 


1-7] SENTENCES 23 


2% « 12 


qr 


--ὃ 0 
FIGURE 1-20 


In solving inequalities, the same rules apply as those given for equations 
in Section 1-6 (addition of, subtraction of, multiplication by, division by 
the same number) with one very important exception. When both members 
of an inequality are multiplied by a negative number, > must be changed 
to < or < to >, ie., the direction of the inequality must be changed. For 
example, 2 < 4, but (—3)2 > (—3)4, since —6 > —12. To solve 
—3x < 18, we divide both members by —3 to get x > —6. (For a 
proof, see Section 3-5.) 


Solve the following sentences and sketch the solutions: (5) —38z > 18, 
(t) 22 << —4, (uy) --ἡ < 2, (v) 3842 < 22 — Ἰ. 


PROBLEMS 


Solve the following and sketch the solutions: 


1. 82 < #41. 2.1 — 2x > 4¢-+ 381. 
3. 8.25a « 5xa-+ ἃ. 4. —x > 2χ — 14. 
5. 24+ 3 — 1.be < 1.724 8.9. 6. —V2e+ V2 « - 1.- 1. 


The following intentional nonsense was written by the mathematician Charles 
Dodgson (Lewis Carroll), whose WEES contain many delightful illustrations 
of mathematical ideas. 


And as in uffish thought he stood, 

The Jabberwock, with eyes of flame, 
Came whiffling through the tulgey wood, 
And burbled as it came! 


(3) 


As Alice said, “Somehow, it seems to fill my head with ideas—only I don’t know 
exactly what they are!” 


%7. Replace nonsense words in (3) by variables, and decide whether the 
result is a sentence. 
*8. Make significant substitutions for the variables so as to form a statement. 
*9. Is (3) a sentence from Alice’s point of view? 
*%10. Is (3) a sentence in the context of the Jabberwock’s fictional world? 


Solve Problems 11 through 14 for z, indicating any parameters. 


11. A is the father of the father of z. 

12. Ais ason of a son of zx. 

13. 2 —a = 2. 

14. X served more than three terms as President of the United States. 
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%* 15. C. C. Fries in his Structure of English (Chapter II) refers to over one 
hundred definitions of “sentence.” The word most often means any unit of 
discourse, and sentences are classified as declaratory, exclamatory, interrogatory, 
or imperative. To which of these kinds does “sentence” as used in this book 
refer? Give examples of each kind. 


ANSWERS TO EXERCISES 


(a) 2 is the only one. (0), (ὁ), and (d) are sentences. Note that a false 
statement is stilla sentence. (Ὁ) isastatement. (06) and (d) are propositional 
formulas. (f) Yes, if their members are terms. (4) Anything 15. ἃ solution. 
(k) 2. (1) Positive or zero values. (τ) Negative or zero values. (n) Nega- 
tive or zero values. (0) Positive or zero values. (p) All points to the left 
of —1. (q) All points to the right of 6. (Ὁ) All points to the left of —3. 
(s) All points to the left of —6. (0) All points to the left of —2. (u) All 
points to the right of —2. (v) All points to the right of 4. 


ANSWERS TO PROBLEMS 


l2z<05. 2. « —5. 3.2 < 12/18. 4.4 < 14/3. 5.2 > —59/12. 
7. And as in x thought he stood, the y, with eyes of flame, came z-ing through 
the w wood, and burbled as it came. Yes. 8. troubled, chimpanzee, rush-, 
tangled. 9. No. 10. Yes. 11. 2 is a grandchild of A. 12. xz is a grandparent of 
A. 13.2 τ 4(ᾳ-- 2. 14. F.D.R. 15. Declaratory. 


1-8 Laws. Sentences may be true for some values of their variables 
and not for others, as we have seen in the last two sections. Some sen- 
tences are false for all values of their variables. For example, x/x = 2 is 
never true. Still other sentences are true for all significant values of their 
variables. An obvious example is 7 = 7z. 

A sentence all of whose values are true we call a law. A true statement 
is a law, since its one value is a true proposition. A propositional formula 
is a law if and only if every one of its values is a true proposition. This 
means that any significant substitution for its variables yields a true state- 
ment. A law has the convenient property that it can be applied to any 
situation in which the things under discussion are values of its variables. 


(a) What do we call an equation that is a law? (Ὁ) In order to show that 
a sentence is not a law, it is sufficient to cite one false value. Why? (c) Show 
that 2x = x2 is not alaw. (d) Show that 2x ~ 2? is notalaw. (ὁ) What 
do we mean by applying a law? (f) Why would it be incorrect to say that a 
law is a sentence that becomes true no matter what is substituted for its 
variables? 


The main goal of scientific activity is the formulation of useful laws. 
_ For example, Boring writes, “The whole value of science is that 1t reduces 
the complexities of the world to general rules, which, once established, 
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enable you to explain or predict many, many individual cases.” (Founda- 
tions of Psychology, p. 14) 


Discuss the following, restating them if possible so as to bring out the 
nature of their variables and their status as laws. (g) “Any two bodies are 
attracted to each other by a force proportional to the product of their masses 
and inversely proportional to the square of the distance between them.” (New- 
ton’s Law of Universal Gravitation) (h) Every change in nature is produced 
by some cause. (The so-called law of causality) (i) In every given state of 
the arts an increase of labor or capital beyond a certain point, as in the cultiva- 
tion of land, causes a less than proportionate increase in the production from 
the unit to which the additional labor or capital is applied. (The law of dimin- 
ishing returns) (j) “The following formula may be postulated. Laughter 
occurs when a total situation causes surprise, shock, or alarm, and at the same 
time induces an antagonistic attitude of playfulness or indifference.” (John 
M. Willmann, “An analysis of humor and laughter,” American Journal of 
Psychology, 1953, p. 70) (Κ) “No peoples seem to lack magic formulas and 
magicians ....” (C. Grant Loomis, White Magic, 1948, p. 5) (1) “... to ask 
for the cause of an event is always to ask for a general law which applies to the 
particular event.” (R. B. Braithwaite, Scientific Explanation, 1953, p. 2) 
(mm) The negative of the negative of a number is the number itself. 


Whether or not a sentence is a law depends upon the context. In the 
elementary algebra of numbers, “ab = ba” is a law. However, if we think 
of the variables as standing for acts instead of for numbers and think of 
ab as the act consisting of doing a and then doing b, “ab = ba” is not a 
law. For example, let a = putting on one’s shoes, and ὃ — putting on 
one’s socks. The act of putting on one’s shoes and then one’s socks is not 
the same as the act of putting on one’s socks and then one’s shoes. This 
illustrates the fact that a law is a sentence that is true in a particular con- 
text and for particular substitutes for its variables, namely the significant 
substitutes for the variables in that context. 

A sentence involving variables is a propositional formula, not a state- 
ment. Hence.a law involving variables is not a statement; it is a proposi- 
tional formula that has only true values. For example, “ab = ba” is not a 
statement. However, “αὖ — ba’ is a law” is a statement because it makes 
an assertion about the sentence “ab = ba.” To avoid having to make awk- 
ward statements such as “αὖ = ba’ is a law,” it is customary in mathe- 
matics to write laws as though they were statements. When this is done, 
the quotation marks and the phrase “is a law” are implicit. For example, 
when we write “ab = ba” we mean to claim that this sentence is a law. 
In the same way, when we say “A man has a soul,” we mean that “zx has 
a soul” is true for all human values of z. 

Since sentences that are not laws often appear in mathematical writing, 
the above convention might lead to confusion. However, the danger is 
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minimized by the convention that mathematical laws are displayed and 
labeled explicitly in some way. When this convention is not followed, the 
meaning has to be guessed from the context or by inspection of the sen- 
tence itself. For example, “Prove p” or “Show p” suggests that p is a law. 
On the other hand, “Solve p” suggests that p is not a law (though it may 
be), and “Consider p” leaves the matter open. In this book, a sentence 
displayed with a number in bold-face type is a law. 

For each of the following, specify significant values of the variables and 
decide whether the sentence isa law: (n) 2? = σα. (0) If zis taller than 
y, and y is taller than z, then z is taller than 2. (p)2=1-+71. (q) zx 
is congruent to y, then x is similar to y. 

We have seen that “p is a law” is a statement whenever p 1s a sentence, 
even if p is a propositional formula. We can always convert a proposi- 
tional formula into a statement by making a statement about it. When- 
ever p is a sentence, “p is a law,” “p is not a law,” “p has at least one true 
value,” “p is always false,” etc., are statements. The operation of making 
a statement about the truth of the values of a propositional formula is 
called quantification. We discuss it further with the aid of special symbols 
in Section 2-10. . 

Is each of the following true for all, some, or none of the significant values 
of its variables? (r)z=1. (8) 3: =2+2. (ὃ) (a+b)? = a? + δ. 
(u) If p is a law, then p has at least one true value. 


PROBLEMS 


1. Give an example of a law, indicating the variables and their values. 


In Problems 2 through 8, the values of the variables are numbers. Which 
are laws? 


2xn<27+1. . 3. a1 + δὴ) = a+ ab. 
4. |x| > 0. 5. Va+tb = Va+ Vb. 
6. 1/a+ 1/b = (a+ b)/ab. 7. ca/cb = a/b. 


8.at+t(btec) = (at+bd+e. 
In Problems 9 and 10, what are the values of the variables? For these values, 
is the sentence a law? 


9. If x is parallel to y, then y is parallel to z. 
10. If x is parallel to y, and y is parallel to z, then z is parallel to z. 


#11. To what extent does our use of “law” correspond with its use to indicate a 
statute, i.e., a law in the legal sense? | 


ANSWERS TO EXERCISES 


(a) An identity. (Ὁ) Since a law has only true values. (c) Let x = 5. 
(4) Letx = 20rz = 0. (e) Substituting for its variables. (f) It is alleged 
to be a true statement only for significant values of its variables. For some 
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substitutes it may have no meaning. (g) “Any two bodies” is the key ex- 
pression here. The law could be stated “If z and y are bodies, then z and y 
are attracted....” The sentence is claimed to be true whenever x and y are 
displaced by the names of bodies. (h) “If xis a change in nature, then x has a 
cause.” (i) The variables are “stage of the arts,” “increase of labor or capital,” 
and “increase in production.” (1) Variable is “a total situation.” (Κ) “If 
2 is a people, x seems to have magic formulas....” (1) Variable is “an event.” 
(m) Variable is “a number.” (n) Numbers. Law. (0) Persons, buildings, 
or other things that have heights. Law. (p) No variables. Law. (q) Geo- 
metrical figures. Law. (1) Some (one). (s) Some. (t) Some. (u) All. 


ANSWERS TO PROBLEMS 
2, 3, 6, 7, and 8 are laws. 4 is not a law since it is false for z = 0. 5 is nota 


law since it is false fora = ὃ = 1. In 7, “O” is not a significant substitute for 


“c,” so 7 is a law even though it is meaningless for ὁ = 0. See the answer to 


Exercise (f). 9. Lines; a law. 10. Lines; not a law since it is false when x = Ζ. 


1-9 Simple arithmetic identities. The simple addition 


37 
(1) τὰ 
82 


is done by following rules about adding columns and carrying, with which 


the reader has been familiar for many years. But why does the procedure 
give the right answer? To see the situation more clearly, we write 


(2) 37 + 45 = (80+ 7) + (40 - δ). 
Now in (1) we began by adding the 7 and 5; that is, 


(3) (30 + 7) + (40 + 5) = (30 + 40) + (7+ 5) 
(4) — (30 + 40) + 12. 


Then we carried; that is, 


(5) (30 + 40) + 12 = (30 + 40) + (10 + 2) 
(6) = (30 + 40 + 10) + 2. 
Finally, we added the tens to get 80 + 2 or 82. Such was our procedure, 


although considerably abbreviated in (1). Why are all these rearrange- 
ments of the terms justified? 
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Let us look at (3), for example. From the left to the right side we have 
interchanged the 40 and the 7. The shift can be justified by using the fol- 
lowing laws: 


(7) atb=b-+a (Commutative law of addition), 
(8) a+(b+c) = (a+b) τ ς (Associative law of addition). 


We apply (8) first by substituting in it (80 + 7) for a, 40 for ὃ, and 5 for 
c. We get 


(9) (30 + 7) + (40 + 5) = [(80 + 7) + 40] + 5. 


This begins the process of getting the 40 and 30 together and the 7 and 5 
together. We continue as follows, where each right member 15 synonymous 
with the right member above it, and so with the left member of (9) and (3). 


(10) = (G0 +7) +40] +5 = [80 -Ἐ (74 40)] +5 
(11) = [30+ (40+ 7)] +5 
(12) = [(30 + 40) + 7] +5 
(13) == (30 + 40) + (7 + 5). 


The steps are justified as follows: step (10) by substituting 30 for a, 7 for 
b, and 40 for c in (8); step (11) by substituting 7 for a and 40 for ὃ in (7); 
step (12) by substituting 30 for a, 40 for ὃ, and 7 for ὁ in (8); and step (13) 
by substituting (30 + 40) for a, 7 for ὃ, and 5 for ς in (8). 

The above steps are tedious, and we should not wish to go through 
them when doing additions. However, they show how the commutative 
and associative laws underlie even the simplest arithmetic. They in- 
dicate that the procedures in (1) may be justified by reference to simple 
laws that can be applied repeatedly to rearrange terms in a sum according 
to convenience. 


(a) Actually write out the results of making the substitutions indicated 
above in justifying steps (10) through (13). 


We are inclined to think of (7) and (8) as obviously true, because our 
experience with numbers has made us acquainted with many instances in 
which they apply. In terms of the geometric interpretation of addition 
given in Section 1-4, we know by experience that to go a directed distance 
a and then ὃ puts us at the same point as going a directed distance ὃ and 
then a (see Fig. 1-21). This lends plausibility to (7). 


(b) Give a similar argument for (8). (0) Illustrate (7) and (8) by sub- 
stituting numerals for the variables and evaluating the two sides. 
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a 
{ὦ | 


a a ΘΟ. eee a b 
gs ee te ab = ba 
Figure 1-21 FIGURE 1-22 


Two laws that for multiplication play a role similar to that played by 
(7) and (8) for addition are 


(14) ab = ba (Commutative law of multiplication), 
(15) a(be) = (ab)c (Associative law of multiplication). 


The vector interpretation of multiplication does not by any means make 
(14) evident. But (14) seems plausible if we recall that ab may be inter- 
preted also as the area of a rectangle of base a and height b, as indicated 
in Fig. 1-22. But then ba is the area of a rectangle of base b and height a. 
The obvious equality of the two areas suggests (14). 


(d) Give a similar argument for (15) by considering volumes. (6) Illustrate 
them for particular numerical values of their variables. 


The arguments above are arguments for the plausibility of (7), (8), (14), 
and (15). They certainly suggest their truth, but they do not prove them 
beyond any doubt, because they depend on diagrams and interpretations 
that may or may not always hold. (For example, what about negative 
numbers?) In elementary algebra (7), (8), (14), and (15) are usually 
assumed without proof. We shall discuss this matter and the nature of 
proof later. 

᾿ We list below some other laws used in simple calculations of arithmetic 
and elementary algebra. They hold for all numerical values of their 
variables, except that no denominator can be zero. (We explain the reason 
for this in Section 1-14.) 


(16) b+ (a — ὃ) = a. 


This identity expresses the idea that a — ὃ is the number that yields a 
when it is added to ὃ (see Fig. 1-10). 


(17) b- (a/b) = a. 
(f) What idea does (17) express? 
(18) a—b=a-4  (—), 


(19) a/b = a: (1/b). 
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a 
LS 


FIGURE 1-238 


Fill in the following: (g) (18) expresses the rule “to subtract, 
Δ΄ .," (h) (19) expresses the rule “to divide, _______ and _______..” 
(i) Explain Fig. 1-23, describe a method of doing vector subtraction based 
upon it, and compare it with that given in Section 1-4. (j) Suggest an alterna- 
tive method, based on (19), for finding a/b in vector terms. 

(20) a—a=Q, 


(21) aja = 1. 


(k) Express (20) and (21) in words. (1) Draw a vector diagram for (20). 


(22) a-+0O= a4, 
(23) O+a= a, 
(24) l-a=a, 
(25) a-l=a 


(m) How would you describe the vector corresponding to 0 in (22) and (23)? 
(n) Why do (22) and (23) seem plausible in terms of vectors? (0) Why do 
(24) and (25) seem plausible in terms of vectors? 


(26) a+ (—a) = 
(27) a+(1/a) = 
(28) a—O0O= a4, 
(29) a/l = a, 
(30) a-Q = 0, 
(31) 0-a = 0, 
(32) 0/a = 0. 


The following identities are used for dealing with expressions involving 
minus signs. 


(33) —(—a) = 4, 
(34) (—l)a = —a, 
(35) (—a)b = —(ab), 
(36) a(—b) = —(ab), 


(37) (—a)(—b) = ab. 
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(p) Substitute numerals for the variables in (16) through (37), calculating 
each side separately and verifying that the two sides are equal. Do this for 
several different substitutions, for negative and nonintegral numbers as well 
as for natural numbers. Note that these examples show that the laws hold in 
some cases, buf examples cannot prove them conclusively, because we might Just 
have happened to choose solutions of the equations. 


What is the importance of identities of the kind written above? Simply 
that they can be applied in order to simplify expressions or to rewrite them 
in some desired way. For example, if we see the expression 5 + (9 — 5), 
we know immediately that it has the same value as 9, because by sub- 
stituting 9 and 5 for a and b in (16), we get 5 + (9 — 5) = 9. Hence we 
may write the equation in this way, and we may also replace 5 + (9 — 5) 
by 9 or 9 by 5 + (9 — 5) whenever we wish, since they stand for exactly 
the same thing. 

Because of (7), (8), (14), and (15) we can rearrange the terms in a sum 
or the factors in a product in any order and in any grouping. When an 
expression is written in a different form by using these commutative and 
associative laws, we call the process rearrangement. When laws are used 
to effect a reduction in size or complexity of an expression we speak of 
simplifying. 


Simplify the following, indicating the identity used: (q) 2/1, (r) 8: (1/2), 
(s) —27/(—27), (ὦ) 0/100, (ἃ) (5000)(2/2), (v) 0-(—10), (w) 
(—2)(—5), (x) 14+ (—14),  (y) (—1)(45), (4) 1000 ’ (85/1000). 


PROBLEMS 


1. Justify the synonymity of the right members of (5) and (6) by reference 
to (8). 

2. Which is easier, 2° 5-37 or 2.87. δ᾽ What laws justify doing the easier 
calculation? 

3. What laws justify writing 2(—3)(—2) = (2)(—2)(—3)? 

4, Which is easier to evaluate, 84 - (16 — 59) or (84 - 16) — 59? What 
law assures the equality of the two? 

5. Rearrange the following so as to facilitate calculation: (a) 12 + 115 + 88, 
(b) 5+ 37+ 2, (ec) 150 — 60 - 150, (d) —5-+ 73 + 105. 

6. Justify (21) in terms of the vector interpretation of division. 

7. Argue for (26) and (27) in vector terms. 

8. Argue for (28) through (32) in vector terms. 

9. Sometimes in doing calculations one “cancels” two terms. What law or 
laws justifies this? What remains after cancellation? 
*10. When we do additions mentally, it is easier to rearrange in a way differ- 
ent from (1). Thus, we may think 37 + 45 = 37+ 40+ 5 = 77+ 5 = 82. 
Describe this procedure in vector terms and draw a diagram. 
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ANSWERS TO EXERCISES 


(a) (10): 30+ (7+ 40) = (80+ 7) + 40; (11): 7+ 40 = 40+ 7; (12): 
30 + (40 + 7) = (30+ 40) + 7; (13): [(80 + 40) + 7] + 5 = (80+ 40) + 
(7 -᾿ 5). (Ὁ) One goes a distance a, then a distance ὃ - c; or one goes a 
distance a - ὁ, then c. (d) a(bc) is the volume of a rectangular cylinder of 
base bc and height a. (f) a/b is the number that yields a when it is multiplied 
by ὃ. (g) change the sign; add. (h) invert; multiply. (1) Rule: add the 
vectors a and —b. (7) Divide the unit interval into ὃ parts, then lay out a 
of them. 

(k) Any number less itself is zero. Any number divided by itself is one. 
(m) Of zero length and indeterminate direction. (n) Adding a zero vector 
produces no change. (0) A vector laid out just once yields itself. (q) 2 by 
(29). (ὦ 3/2 by (19). (s) 1 by (21). (0) O by (32). (ἃ) 5000 by (21) 
and (25). (v) Oby (81). (w) 10 by (37). (x) Oby (18) and (20). (y) —45 
by (35) and (24). (z) 85 by (19), (14), (15), and (27). 


ANSWERS TO PROBLEMS 


1. By substitution of (30 - 40) for a, 10 for ὃ, and 2 for c. 2. The first. 
(14) and (15). 9. (20) and (21). When (20) is used, 0 remains and can then 
be dealt with by (22), (23), (30), or (31). When (21) is used, 1 remains and 
can be dealt with by (24), (25), or (29). Since “cancellation” is so ambiguous, 
it is better to think in terms of identities and avoid crossing out symbols 
carelessly. 


1-10 Parentheses. Consider the following alleged sentence, based on 
one appearing in a book about the stock market. 


This policy involves avoiding diversification and holding 
(1) : ᾿ ᾿ : : 
one’s capital uninvested for long periods of time. 
What did the writer mean? Did he mean to avoid both diversification 
and holding, or did he mean to avoid diversification but to hold capital 
uninvested? In everyday English, the meaning can be clarified by punc- 
tuation marks, additional words, or other devices. In mathematics, 
parentheses are used. One meaning of (1) may be expressed as follows: 


This policy involves avoiding not only diversification but 
(2) also holding one’s capital uninvested for long periods of 
time. 


This policy involves avoiding (diversification and holding 
one’s capital uninvested for long periods of time). 


(27) 


1-10] | PARENTHESES © 33 
The other meaning may be expressed as follows: 


This policy involves not only avoiding diversification but 
(3) also holding one’s capital uninvested for long periods of 
time. 


This policy involves [(avoiding diversification) and (hold- 
ing one’s capital uninvested for long periods of time)]. 


(37) 


(a) Discuss the two meanings of the following sentence and rewrite it as we 
did (1): I liked Mary and John and Bill liked me. 


A pair of parentheses in mathematics, as in (2’) and (3’), indicates that 
the expression within is to be treated as a single term with respect to any 
indicated operations. This means that the value of the entire expression 
within parentheses is to be determined before its parts are involved in 
evaluating the context in which it appears. Thus (a + b)? means that 
we evaluate a + ὃ and then square the result. 

We have already seen in Section 1-9 the way in which parentheses 
affect the meaning. Consider the formulas a + (ὃ +c) and (a+ δ) +c 
in the associative law of addition (1-9-8).* The first means to find (b + c) 
and add the result to a. The second means to find a + ὃ and add ¢ to the 
result. Fora = 2,b = 3, andc = 5, the first becomes 2 + 8, the second 
5+ 5. The associative law states that these two different procedures 
yield the same result. Because of this law, it is customary to omit paren- 
theses when only the operation of addition is involved. Similarly, because 
of (1-9-15), parentheses are omitted when only the operation of multi- 
plication is involved. 


(Ὁ) Indicate two ways of evaluating (4) - (5) - (6) by using parentheses. 


When both addition and multiplication are involved, ambiguity is pos- 
sible unless parentheses are used. Thus (2-3) - 4 ¥ 2. (3 + 4), so that 
“2:3 +4” might be ambiguous. To reduce the number of parentheses 
required, the convention has been adopted to always do multiplications 
before additions. Then 2-3 +4 = (2-3) + 4 by this agreement, and 
parentheses have to be used only if we wish to indicate 2- (3 + 4). The 
omission of the dot for multiplication suggests this convention. Thus we 
usually write (2)(3) + 4 in place of 2. 8 + 4, bringing closer the factors 
to be multiplied before the addition is performed. | 

A convention that covers all four operations is: perform multiplications 
first and then divisions in the order in which they appear from left to right; 


* (1-9-8) means formula (8) in Section 1-9. 
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then additions and subtractions in order from left to right. As examples, 
3-4/7-5 = (8. 4)Χ(7 - 5) and not 3- (4/7) -5; 3,4 -Ῥ 2 = (8/4) + 2 
and not 3/(4 -+ 2);and5 — 14+ 2 = (5 — 1) - 2andnot5 — (1 + 2). 
There is one operation that comes even before multiplication, and that is 
finding a power of a number. Thus 2-37 = 2(3”) and not (2-3)*. These 
conventions can be summed up by saying that the operations are per- 
formed in the following order: taking powers, multiplications in order, 
divisions in order, and finally additions and subtractions in order. Where 
any doubt is possible, parentheses should be used, but these conventions 
cut parentheses to a minimum. 


Evaluate: (ὁ) 1+2-8, (4) 3-4— 2:17, (6) 2: ὅ,8’ 4 — 4, 
(Ὁ 5/5 ~1, (g) 2+ 3/6, (h) (2+ 3)/6, ὦ (1+ 2253, (j) 2: 83, 
(k) —2?, ὦ (—2)2, (m) 2+3+4-—5-7+ 3/6-5, (nm) (8/6) - ὅ, 
(0) 2-4/3 — 1. 

Rewrite the following in as simple and compact a way as possible: 
(p)a-2—1+a, (q) 5+ (3+ 4), (ἡ 8/2/4, 46) 1+ a — 2, 
(0) «. -- y*-z, (u) 2:4 -- 5. 


A useful concept for talking about the meaning of operation symbols 
is the concept of scope. The scope of an operation symbol (such as the plus 
sign or multiplication dot) or a relation symbol (such as the equals sign) 
is the expression to which it applies. Thus the scope of 2 in α is a, but the 
scope of 2 in (1 + a)? is 1+ a. Also, in 2-3 +4 + 1, the scope of the 
multiplication dot is 2 on the left and 3 on the right. However, the scope 
of the plus sign is 2-3 on the left and 4 + 1 on the right, since it is 2-3 
and 4 + 1 that are to be added, not 3 and 4. 


Indicate the scope of each operation or relation symbol in the following: 
(v) (2+ 4-3 — 59/7 =a— 2c, (w) a = ὃ, and 32 = 4y — 2/37, 

(x) 1 — (8 — 1/2)/4. 

Often it is necessary to have parentheses within parentheses. When 
parentheses are nested, reading is made easier if we use different kinds of 
enclosures. The most common are the ordinary parentheses ( ), brack- 
ets [ ], braces { }, and the bar . Braces are used in this book for the 
Special purpose of designating sets, as explained in Chapter 3. The bar 
is used primarily for radicals and fractions. For example, 


(4) Va+b = V(a-+ db), 
©) ce = (πώ, Ὁ ὃ). 


In (5) the bar serves as a parentheses for both numerator and denominator 
and also as a sign of division. 


(y) What is the scope of V in (4) and of the bar in (5)? 
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The use of different kinds of enclosures is illustrated by the following: 
(6) (a+ 6)(1 — z)(2 + (¢ — 1))) = (a+ bd) — 2z)(2 + [e — 1). 


The left side is a little harder to read, but the corresponding pairs of 
parentheses can be identified in it. Indeed, corresponding pairs are sep- 
arated by none or an even number of parenthetical marks. 


(z) Identify corresponding parentheses by assigning them the same number: 
( € δ)ί( ) € DC )) 


Within reason it is better to have too many parentheses than too few. 
In science verbosity is better than ambiguity. Be as explicit as possible; 
then abbreviate if this can be done without causing doubt as to the 
meaning. 

PROBLEMS 


? 


1. Discuss “Save rags and waste paper,” and show how to use parentheses 


to give two different meanings. 


Tell whether the removal of all parentheses from the left member of each 
of the following formulas would yield a synonymous expression: 


2. (1-9-8). 3. (1-9-15). 4. (1-9-37). 5. (1-9-36). 
6. Why do we use parentheses in 2(3.08) and not in ab? 


Show that 7 through 10 are not laws. 


7. (a+ 6)? = α- b? (ἢ). 
8. (a+ b)/(e+ d) = (a+ b)/e+d (ὃ). 
9, —a+b= —(a+b) (?). 

10. Va +b = Va+vV0 (ἢ). 

11. Multiply your age by 2, add 5, and multiply the result by 50. Add the 
amount of change in your pocket or purse if it is less than one dollar. Subtract 
the number of days in the year (865) and add 115. Let A be your age, C the 
amount of change if less than one dollar, and C = 0 otherwise. Represent this 
rule in a formula. 


Do 12 and 13 as in Exercise (z). 


12. (( )Ἱ͵ κι ))C )). 18. (C ({ δ) )). 


14. The degree of spottedness of the solar disk is commonly expressed by 
a system of relative numbers known as Wolf’s sunspot numbers. To find the 
relative number we first count one for each individual spot then ten more for 
each spot or cluster of spots. Moreover, in order to compare observations made 
at various observatories by different astronomers using a variety of instruments, 
Wolf reduced them to a common basis by means of multiplying by properly 
calculated factors. Write the formula for Wolf’s number r in terms of the num- 
ber of groups and single spots g, the total number of spots observed f, and the 
reduction factor for different observing conditions k. 
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ANSWERS TO EXERCISES 


(a) I liked Mary and John, and Bill liked me = [I liked (Mary and John)] 
and (Bill liked me). I liked Mary, and John and Bill liked me = (1 liked Mary) 
and (John and Bill liked me). (Ὁ) (4° ὅ) 6 and 4: (5-6). (ce) 17. (α) 
—22. (ὁ) —19/6. (f) 0. (g) 5/2. (8) 5/6. (i) 9. (4) 18. (1) —4. 
( 4. (m) —907/30. (ὦ) 5/2. = (0) 5/8. Ο) 8a — 1. — (q)-:20/8. 
(r) 1. (s) 1/a—2. (ὃ a/zy*. (δ 8/5. 


(v) (2- 4.8 — 59/7 =a—2-c (w)a=b,and3-2 = 4-y — 2/3? 
he es St τ τος Ἐν ΑΕ, 


τι Πι τὰς - :ϑῪἋ τς «ὁ  .. 
ae eee oe ΕΣ sles whee? ὑπ|Ὸ- 
me ΠΡ a δε 13 55. Ξ Ξ ας 
oe ene pees τ τὶ 


(x) 1 — (8 — 1/2)/4 
ist Noelia τς 


ane eee 
Εἰς 
earn WA 


(y) a+ ὃ; 6 - ἃ on the left and α - 6 on the right. 


eC ( ))( ) ( )ί )}) 
1 2 2184 4 5 56 63 


ANSWERS TO PROBLEMS 


2. Yes. 3. Yes. 4. No. “—a — δ᾽ is not synonymous with “(—a)(—6).” 
5. No. 6. 23.08 ~ 2(3.08). 7. Let a = ὃ = 1. Similarly in 8 through 10. 
11. δ0[2Α + 5] + C — 365 + 115. 


12. ((( )ί JE )) }} 13. (C ( ))C )) 
123 34 45 526 61 12 3 324 41 


14. r = k(10g+ ἢ. 


1-11 Replacement and substitution. If two expressions are synony- 
mous, it is obvious that either can be used in place of the other without 
changing the meaning of any context in which they appear. This is true 
because synonymous expressions, whether constants or formulas, always 
have the same value. We call the process of putting one of two synony- 
mous terms in place of another replacement. For example, in Section 1-9 
we said that “5 + (9 — 5)” and “9” could replace each other because 
5+ 9 — 5) = 9. 

The operation of replacement is a very useful one, even though it ap- 
pears at first sight to introduce nothing new. For example, the calculation 
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in (1-9-9) through (1-9-13) consists of replacing the right member of 
(1-9-2) by an expression synonymous with it and then continuing to 
make replacements until the final desired result is obtained. 

Replacement of terms by synonyms is often helpful in deriving laws. 
For example, we can use this method to derive a + (a — a) = a from 
the laws in Section 1-9. By (1-9-22), a+ 0 = a, and by (1-9--20), 
a — a= 0. Hence in the first of these we may replace 0 by a — a to get 
the desired result. 


(a) Cite examples of replacement in previous sections and construct other 
examples. (Ὁ) Derive a(a/a) = a from the identities in Section 1-9. 


We shall use replacement so frequently that it is worth while to have 
an explicit rule to justify it. 


RULE OF REPLACEMENT: If a term in an expression is re- 
(1) placed by a synonym, the resulting expression is synonymous 
with the original. If the original is a law, so ts the result. 


Throughout this book we use “replacement” only in this sense. To il- 
lustrate the first part of the rule, 2(—a)(b) = 2(—ab), since the right 
member is obtained from the left by replacing “(+a)b” by “(—ab)”, which 
is synonymous by (1-9-35). To illustrate the second part of the rule, 
0/(a + 0) = 0 is a law because it results from (1-9-32) by replacing “a” 
by “a + 0,” which is a synonym by (1-9-22). 

Replacement is useful, but it alone will not carry us very far. To make 
good use of laws we must take advantage of the fact that they are true for 
all values of their variables. To do this we make significant substitutions 
for variables in them to get the expressions we wish. For example, if we 
substitute “2” for “a” in (1-9-31) we get the law 0-2 = 0. 

We use substitution so much that it is convenient to have a special 
notation to indicate it briefly. To indicate that an expression is to be sub- 
stituted for a variable, we write the variable followed by a colon and the 
substitute. Thus “(a:2)” means the substitution of “2” for “a.” Also, 


(2) a(a:b) = b?, 

(3) (hall)(a:i) = hill, 

(4) (a + x”)(x:a) = at a?, 

(5) (c= a*)(v:a+y) = (@+y = a?), 
(6) (x + xy)(a:2a) = 2a + 2ay, 


(7) (ab = ba)(a:3) = (3b = 03). 
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The brevity of the notation is emphasized by comparing (2) with “When 
δ is substituted for a, a? becomes b?” or with “The value of a? when the 
value of a is ὃ is b?.” (Note here the omission of some quotation marks, 
whose inclusion would make these expressions even more unwieldly.) 


Find: (c) τίσ: α), (d) y(y:a@), (e) x*(z:a%), (f) (2a)(a:2%), 
(6) (b?-+ b)(b:a), (8) (the successor of x)(z:Henry IV), () (1-9-26)(a:3).* 


It should be noted that (a:b) is the substitution of “b” for “a” every- 
where in the expression involved. This is essential, because we use this 
notation to indicate substitution for variables, and it is understood that 
a, variable cannot take different values at the same time. 

The notation is easily extended to take care of simultaneous substitu- 
tions for as many variables as we wish. To indicate that a and ὃ are to be 
substituted for x and y, we write (στα, y:b). As examples, (x — y) 
(x:a, y:b) = a — b; («& — y)(ara, y:a) = a — a; and (ὦ is a sibling of y) 
(x:John, y:Mary) = John is a sibling of Mary. Similarly, (1-9-17) 
(a:2, b:4) = [4(2/4) = 2]. 


Find: (Ὁ) (ἃ never repeats exactly)(x:a physical system); (k) (1-9-36) 
(a:2, 6:3); (ἡ (w? + x — 12)(2:3); (mm) (@? + zy + 12y?)(a:l, y:1). 


Since a law is true for all values of its variables, we get a true statement 
by substituting for the variables in any law constants that are significant 
substitutes. (Compare Section 1-8.) Such a substitution amounts to 
applying the law to a particular case. It is a matter of specialization. 

Write the following statements: (n) (1-9-7)(a:2, 6:7); Ἔὄἀ(ο) (1-9-18) 
(a:0, b:—1); (p) (1-9-14)(a:3, 6:—4); (q) (1-9-18)(6:1); (r) (1-9-15) 
(a:2, 6:5, c:—8). 

What happens when we substitute other variables for the variables in 
a law? Clearly, the result is also a law, since this just amounts to writing 
the law in terms of other symbols. Thus (1-9-8)(a:z, b:y, c:z) is another 
way of expressing (1-9-8) itself. Hence we may substitute variables for 
variables in laws in order to get laws. 

What may we say about the result of substituting a formula for a 
variable in a law? Clearly, if every value of the substituted formula 18 
also a value of the variable, the result is a law, for any substitution in 
the resulting sentence yields a value of the original sentence and hence a 
true statement. For example, in 1 -@ = a any number 15 a value of “a.” 
Now every value of “2x” is a number. Hence we may substitute “2x” 


4 


for “a” in order to obtain the law, 1-2” = 2z. 


Write the following laws: (s) (« = x)(x:a-+ ὃ); (t) (1-9-19) 
(a:z,b:x + y); (ὦ (1-9-83)(a:a + b+ ὁ). 


* Here (1-9-26) is an abbreviation for formula (26) in Section 1-9. 
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Since substitution in known laws is the most common device used in 
deriving new laws, we formulate the following rule. 


Rue oF Susstirution: If a sentence is obtained by making 
(8) significant substitutions for the variables in a law, the result 
7s a law. 


Note that significant substitutes include both constants and formulas. 
What the significant substitutes are must be determined from the context 
if no explicit indication is given. We use “substitution” only in this sense 
throughout this book. 

Almost all the work of so-called elementary algebra in the schools 
consists of making replacements and substitutions in known laws. When 
an expression is inserted by either rule, care must be taken that the ex- 
pression as a whole is subjected to the same operations as the term whose 
place it takes. This may be assured by enclosing the insertion in paren- 
theses. When any doubt is possible, parentheses should be used and then 
removed if that would cause no change in meaning. Thus (1—-9-33) 
(σα + 2) = [—[—(a + 2)] = a+ 2). However, if we did not enclose 
a-+ 2 in parentheses before substituting, we should get —(—a + 2), 
which is not equal toa + 2. Similarly, a?(a:1 + x) = (1+ 2)?. Without 
parentheses we should get 1 + 2”. 


(v) Show that α - 2 = —(—a-+ 2) and (1+ 2)? = 1+ 2? are not laws. 


The way in which the rules of substitution and replacement are used in 
calculations is illustrated in (1-9-9) through (1-9-13). There we made 
replacements, each one of which was justified by an identity resulting 
from substitution in a law. For example, we went from (1-9-10) to 
(1-9-11) by replacing “7 + 40” by “40+ 7,” which are synonyms be- 
cause 7 + 40 = 40 - 7, and we know this last is true because it is ob- 
tained by the substitution (α:7, b:40) in the commutative law of addition, 
(1-9-7). | 

As a second example, we derive a(1/a) = 1 from other identities in 
Section 1-9. First we write a/a = a(1/a) by the substitution of (b:a) in 
(1-9-19). But a/a = 1 by (1-9-21). Hence we replace a/a by 1 to get 
1 = a(1/a), which is what we wished to prove. 


(w) Why are “x = y” and “y = x” synonymous? 


It is important to understand both replacement and substitution, and 
to distinguish carefully between them. We list some of their differences. 

In replacement the inserted term must be synonymous with the original. 
In substitution the inserted term must be a significant substitute, but that 
is all. 
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Replacement of a synonym may be for any term. Substitution may be 
made only for variables. 

A symbol may be replaced by a synonym in just one of its appearances 
in an expression. A substitute for a variable must be inserted for it in 
every place where it appears. 

Because of these differences, the substitution notation should not be 
used for replacement. 

(x) Find (4:0 = 0)(a:2a). Why is this way of writing preferable to a form 
such as “Finda:-0 = Ofora = 2a”? (y) What is wrong with (2 + y:y + 2)? 
(z) Comment on the following: We know that x+y = y+ y, since it is 
obtained by substituting y for zinz-+y = y+ 2. 


PROBLEMS 


Write the substitution by which the first expression is obtained from the 
second. 


1. 1/1 = l;a/a = 1. 


2atb+c=ctatbat+b =b+ a4. 
3. cla - δ) = (a-+ b)c; ab = ba. 

4, Wat y) = 22+ 2y;a(b-+c) = ab+ ac. 
5. —2(1/3) = —2/3; a/b = a(1/b). 


6. 0/(—1) = 0; 0/a = 0. 
Write the result of making the indicated substitutions: 


7. (—(—a))(a:z + y). 

8. ((—l)a)(a:—a). 

9. ((—a)b = —(ab))(a:z + y, δ — y). 

10. ((—a)(—b) = ab)(a:x?, b:y). 

11. (a(b + c) = ab-+ ac)(a:a + ὃ, b:a, ο:---ὃ). 

12. (a(b-+ c) = ab-+ ac)(a:a + ὃ, bia, c:b). 

What substitution in what law in previous sections yields each of 13 
through 18? 

13. —(—0) = 0. 14, 2/3 — 2/3 

15. 18 +0 = 18. 16. 0- (a — ὃ) 

17. (@+ y)/@ — y) = (at y)/@ — y)). 

18. (a+b6+c¢/atb+c) = 1. 


19. Find ——* (a: R, rt + ὃ. 
, am. 


0. 
0 


1 
20. May we substitute variables for constants? 


ANSWERS TO EXERCISES 


(Ὁ) By (1-9-21) replace 1 by a/a in (1-9-25). (ὁ) a (d) α. (0) (a?)?. 
(ἢ 2x2. (g) (a?-+ a). (8) Thesuccessor of HenryIV. (i) 3+ (—3) = 0. 
(j) A physical system never repeats exactly. (k) 2(—3) = —6. (1) 0. 
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(m) 14. τ S 7a. (0) 0 — (—1) = 0+ (—(—1)). 
(p) 3(—4) = (—4)3. (ἃ) ἃ. = 1 Sa (1), (r) 2(5(—8)) = 
(2(5))(—8). (58) α- ὃ Ξ  α- ὅ. (Ὁ) e/(@-+ y) = 2(1/(e@ + y)). 


(u) —(—(a+b6+ec)) =a+b+ec. (v) (a:1) and (5:1). (w) naz«=y 
we can replace z by y and y by zx by the rule of replacement to get y = z. 
(x) @ = 2a only fora = 0, which is not what we have in mind. (y) We sub- 
stitute only for variables! (2) Substitution has not been made throughout. 


ANSWERS TO PROBLEMS 


1. (a:1). 3. (asc, b:a + ὃ). δ. (a:—2, 6:3). 7. —(—(a+ y)). 9. (—(e@+ y)) 
Gy) = =—(e- y@ ἘΞ}: 11. (a+ b)(a+ ( — b)) = (a+ b)a+ 
(a + b)(—b). 13. (—(—a) = a)(a:0). 15. (a+ 0 = a)(a:18). 
17. (a/b = a(1/b)(a:z + y, b:x — y). 20. There is no logical basis for doing 
so. Scientists try to find propositional formulas that become true statements 
when substitutions are made for’ their variables. They do this by inserting 
variables for constants in true sentences, but this is not substitution in the 
sense in which it is used in mathematics. 


1-12 The distributive law and its consequences. One of the most use- 
ful laws of arithmetic is the distributive law of multiplication. 


(1) a(b +c) = abc ae. 


(a) Verify (1) for (a:3, 6:7, c:2) and (a:2, 6:3, ¢:—1). 


The word “distributive” is used because we may think of (1) as asserting 
that multiplication may be “distributed” over addition. That is, we may 
perform the operation of multiplication either on the result of adding or 
on the two terms before adding. 


Show by substitution in (1) that: (Ὁ) 2-5+24-:2 = 2-7, (c) σία +1) = 
x? + 2x. 

The distributive law (1) is by no means obvious. It does appear plausi- 
ble, however, if we think of the products as representing areas, as indicated 
in Fig. 1-24. Of course this example does not prove (1), because, for one 


thing, it does not apply to negative values of the variables. Actually the 
property expressed by (1) is a very special one. If we interchange the 


ΞΕ 


b c 


b+e 


FIGuRE 1-24 
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operations of multiplication and addition in (1), we get a+ (ὃ - σ) = 
(a + b)- (a + 0), which is not a law. Hence addition is not distributive 
over multiplication. 


(4) Show that the equation in the above paragraph is not a law. (6) Show 
that a: (b:c) = (a+b): (α " 6) is not a law, so that multiplication is not dis- 
tributive over multiplication. (f) Similarly show thata-+ (ὃ - ὁ) = (a+ 6) + 
(a + c) is not a law, and interpret the result in words. 


By applying the rules of replacement and substitution to (1) and 
previously stated identities, we can derive a large number of useful al- 
gebraic laws. For example, we can easily show that 


(2) | (6 + eva = ba + ca. 


To derive (2) we begin by applying (a:b + c¢, b:a) to the commutative 
law of multiplication to get 


(3) (b+ cla = a(b+ ὁ). 
Because of (1) we may replace the right member of this by ab + ac to get 
(4) (b+ cla = ab + ace. 
By the commutative law of multiplication we may replace ab by ba and 


ac by ca in (4) in order to get (2) as desired. 


(g) What substitution in the commutative law of multiplication justifies 
replacing ac by ca? 


The above argument for (2) can be abbreviated conveniently as follows: 


(32) (b+ οδα = α(ὖ -Ἡ ὁ) (ab = ba)(a:b + ¢, b:a) 
(4’) = ab +ac (1) 
(5) = ba + ca (1-9-14) and (1-9-14) (b:c).* 


Note that each step was obtained from that above by a replacement, and 
that each replacement was justified by a law previously stated or obtained 
by substitution in such a law. The laws (1) and (2) are often called re- 
spectively the left and right distributive laws. 


_(h) Justify 2. -ἰ- 75 = 9x by substitution in the right distributive law. 
(i) Derive (6) from (1). 


(6) a(b + e+ ἃ) = ab+ac+ ad. 


* Although we sometimes use numerals as abbreviations for laws, the student 
should always write the laws themselves as we did in step (3). 
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Among the many consequences of the distributive law, we list the fol- 
lowing useful identities. 


(7) (a+ δ)(ο + d) = ac+ad-+ be + bd, 
(8) —(a + ὃ) = (a) + (5), 

(9) (a + b)? = a? + 2ab + b?, 

(10) (a — b)? = a? — 2ab + b?, 

(11) (a + b)(a — ὃ) = α — B?, 

(12) (a + b)? = a® + 3a7b + 3ab? + b3, 
(13) (a + b)(a® — ab + 05) = a? 4 B3, 
(14) (a — δ)(α" + ab + 6?) = α — B?. 


Proofs are similar to that given for (2) above. For example, we may 
derive (8) as follows: 


(15) —(a - δ) = (—1)(a+ ὃ) (—a = (—1)a)(a:a + δ) 
(16) =f Ae ayy ΠΡ 
(17) = (—a) + (--ῦ) (—a = (—1)a)(a:b). 


Similarly, (11) is obtained as follows: 


(18) (a+ b)(a — b) = (a+ δ)ία + (--ὃ)) (1-9-18) 
(19) = aa + ba + a(—b) + b(—D) (7) 

(20) = a?+ba+ (—ab) + (—bd)? (1-9-36) 
(21) =a? —b? (1-9-14), (1-9-20), (1-9-22). | 


(j) Explain the reasons in (18) through (21), indicating the substitutions. 
(k) Complete the following: “(8) expresses the rule that parentheses preceded 
by 8 minus sign may be removed provided_____..” (1) Use (11) to easily 
find 12517 — 12507. (τ) Check (10) for (a:10, 6:9). (ὦ) Show that (14) 
is not an identity if the first minus sign is displaced by a plus sign. 


When an expression in the form of the right member of one of (1), (2), 
(6) through (14), or similar identities, is rewritten in the form of the left 
member, it is said to be factored. When it is originally in the form of a 
left member and is rewritten in the form of a right member, it is said to be 
expanded. Problems in factoring or expanding consist simply of making 
the proper substitution in some law so as to get the given expression on one 
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side of an equation and the desired form on the other. For example, to 
factor x? — 4a” we note that it is in the form of the right member of (11). 
Hence we apply (a:2, b:2a) to (11) to get x? — 4a? = (x + 2α)( — 2a). 
On the other hand, to expand (3 — x)* we note that it is in the form of the 
left member of (10). Hence we apply (a:3, b:x) to (10) to get (8 — 2)? = 
9 — 6a + x’. 

Write out: (0) (9)(a:82, 6:1), (p) (6)(a:x, b:1, c:x, d:2), (q) (7) 
(a:a + y, b:z, cry, d:1). 

Show by substitution: (ὦ) in (11) that (2 — 2y)(z+ 2y) = «2 — 43, 
(8) in (14) that 93 — 73 = (9 -- 7)(81 + 126 + 49). 

Expand, indicating identities and substitutions used: (Ὁ (2 — y)%, 
(u) (2a + 1)(4.3 — 22+ 1), (v) Ο — a) + a). 

Factor, indicating identities and substitutions used: (w) 36 — 02, 
(x) 1 — 2723, (y) 2? -- 22- 1, (2) 92? — y? 


The distributive law and related identities are the basis of the rules of 
multiplication. For example, 


(22)  (25)(37) = (20 + 5)(30 + 7) = 600 + 140 + 150 + 35 


by (7)(a:20, b:5, c:30, d:7). The terms on the right are simply the numbers 
obtained if we carry out the multiplication in the usual way, 


37 or 37 


2 sz 
35 185 
(23) 150 740 
140 925 
600 
925 


The familiar right-hand form in (23) is simply an abbreviated version of 
the left, and this in turn is just a different arrangement of the terms in the 
right member of (22). 


PROBLEMS 


Expand, indicating laws and substitutions: 


1. 2(5 — 3). 2. x(a -+ 1). 3. (4a — B)b. 

4. —(x — y). 5. (2a + 3b)*. 6. (vx + 8) — 4). 
Fide). 8. (x + 1)α — 1). 
Factor Problems 9 through 16, indicating laws and substitutions: 

9. 5a — 42. 10. 2a -+ ὅδ. 11. 22+ 22+ 1. 
12. 2? — 4x-+ 4. 13. 4 — 2?. 14. 2? — 1. 


15. 2? — 6ry + 9y?. 16. 2* + 824 2. 
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17. Simplify ([(85 — 4)¢-+ 2]. — 2)x-+ 5. Note: In deriving an identity, 
work with one side by making replacements until you transform it to the other 
side, as we did, for example, in (15) through (17). Or, you may work with first 
one side and then the other until you have shown each side synonymous with 
the same third expression. However, do not state what you are trying to estab- 
lish as a step in your argument until the end, since this is what you wish to 
show and not what you are given. 

18. Show that 4ab + (a — b)* = (a+ b)?. (For a geometric interpreta- 
tion of this and other identities, see An Introduction to the History of Mathe- 
matics, by Howard Eves, p. 63 ff.) | 

19. Show that (mq — sp)? — m(p — sq)? = (82 — m)(p? — mg?). (This 
identity may be used to show that ~/m is irrational unless m is the square of 
an integer.) 


An odd number is one that can be expressed in the form 2x — 1 where x 
is ἃ positive integer, and an even number is one that can be expressed in the 
form 2x where 2 is a positive integer. In Problems 20 through 24 show that: 


20. The sum of two odd numbers is even. 

21. n? + (n — 1)? and n? — (n — 1)? are always odd, where n is a positive 
integer. 

22. The product of two consecutive integers is always even. 
* 23. The sum of three consecutive integers is always divisible by 3. 
* 24. n? — nis always divisible by 6, where n is a positive integer. 


* 25. Suppose a rope is stretched tight around the circumference of Mars and 
then 10 feet is added to its length. How much would the radius of Mars have to 
increase in order for the rope to fit tightly? (Suggestion: Let r = the radius of 
Mars. Then r-+ x = the new required radius. Then the difference of old and 
new circumferences is 10. Express this and solve for z.) 

26. A sociologist constructed an integration index by combining a crime 
index C and a welfare-effort index E as follows. The crime scores were “re- 
versed” by subtracting them from 20, in order to make high values indicate 
strong integration. These reversed crime values were multiplied by 2 to give 
them double weight. The values of the welfare-effort index were transformed 
by the formula Ε΄ = Μὶ + (s1/s2)(E — Me), where Mi, Mo, 81, s2 were 
determined in a specified way. The two modified scores were then added and 
divided by 3. (The Language of Social Research, by P. F. Lazarsfeld and M. 
Rosenberg, p. 61.) Write a formula for the integration index in terms of E and C. 

27. Show that (a? + b?)(x? + y?) = (ax+ by)? + (bx — ay)?. 

*28. Show that (a? + 6? + c?+ d?)(x? + y? + 22+ #?) is equal to a sum 
of squares. 

29. Show that (a) (2uv)? + (uw? — v?)? = (u2?+ υ22. (b) (2m)? + 
(m? — 1)? = (m?+1)?. (Note: For any substitution, these formulas yield 
the Pythagorean relation between the squares of the sides of a right triangle. 
Hence they can be used to find such triangles. The second formula is credited 
to Plato. See the reference given in Problem 18.) 


Multiplication can be done mentally by thinking directly in terms of (7) 
and holding in the memory only the totals. Thus one might think of (25)(37) 
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as (20 + δ) - (30-+ 7), then think 20-30 is 600, plus 20-7 (or 140) is 740, 
plus ὅ - 30 (or 150) is 890, plus 5-7 (or 35) is 925. It is methods of this sort 
that enable people to calculate rapidly without pencil and paper. Calculate 
the following mentally. 


*30. 1002 — 972. #31. (24) (36). #32. 252 — 28, 
#33. 322. #34. (15)(139). *35. (99) (32). 


*%36. During the performance of a “lightning calculator” a member of the 
audience decided to embarrass the performer by asking him to multiply 5362 
by 9,999,999,999, but the speaker gave the answer almost instantaneously. 
How did he do it? 

ANSWERS TO EXERCISES 


(a) 8(7 + 2) = 3-9 = 27; 3°-74+3-+2 = 2146 = 27. 2(8 — 1) = 
2.2 -- 4.2.8 -Ὁ 2(-1) =6—2= 4. (b) (az, 6:5, 6:2). 
(c) (a:x, biz, 6:1). (4) through (Ὁ) Substitute constants to get a false sen- 
tence. (6) (bic). (h) (a:2, 6:2, 0:7). (Ὁ α( -Ἡ e+ d) = ab+ale+d) = 
ab + ac-+ ad by [(1)(c:c + d)] and [(1)(b:c, e:d)]. () (18): No sub.; (19): 
(c:a, d:—b); (20): (a:b); (21): ba replaced by ab, then (1-9-20}(a:ab), (1-9-22) 
(α:α3). (Κ) The sign of each term within is changed. (1) (1251 + 1250) 
(1251 — 1250) = (2501)(1). (αὐ Find the value of each side separately. 

(n) By substituting constants. (0) (83a-+ 1)? = 927+ 644+ 1. 
(p) ad tet2) =2+2?4+22, (ᾳ4φ @tytayt)) = ὦ -Ῥ ψὴν Ὁ 
(c+ y)+ayt oe. (r) (aia, b:2y). (s) (a:9, 6:7). (0) (10)(a:x, b:y). 
(u) (13)(a:22, 8:1). (ν᾿) (11)(a:3, b:a). = (w) ((1)(α:6). (x) (14) (a1, 6:32). 
(y) (10)(a:z, b:1). (4) (11) (a:32, bsy). 


ANSWERS TO PROBLEMS 


1. 22 — 6. 3. 4ab — b?. δ. 4a2 + 12ab + 902. 7. 2% — 221 -Ἐ 1. 
9, χίδ — 4). 11. @+1)2. 18. (2 —2)(2+2). 15. (« — By)?. 17. 8xt — 
473 + 2χ2 — 22+ 5. 


1-13 Definitions. As pointed out in Section 1-1, we must agree on 
what our terms mean if we are to communicate effectively. Hence we 
should like to define all our terms. But is this possible? To define a word 
we must use other words. If we try to define all our terms, we must define 
the terms we use in defining them. Then these terms have to be defined in 
turn. We shall go on endlessly in this fashion, or else we shall come around 
in a full circle by directly or indirectly defining one symbol in terms of a 
second and then the second in terms of the first. Thus, if we wish to define 
all our terms, circularity is inevitable. 


Not long ago a reviewer wrote that the editors of a well-known dictionary 
“held to one laudable ambition, to put an end to definitions couched in words 
that require reference to another definition, and another, until you land right 
back where you started.” (a) In view of the fact that no word is undefined 
in a dictionary, comment on this ambition. (Ὁ) Suppose that a language 
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contains just five words and that at least one different word must be used in 
defining a word. In a dictionary of this language how many words at most 
could be looked up without “landing right back where you started,” that is, 
- without finding in a definition a word already looked up? (c) Would the 
same sort of argument hold for a language of 100,000 words? (d) Experiment 
with an English dictionary by looking up a word, then all the words in its 
definition, and so on. 


If it is impossible to define all our terms, how do we ever find out what 
terms mean? The answer is, of course, that we learn the meaning of a 
basic vocabulary by seeing how terms are used and by having objects 
pointed out as the values of certain terms. Knowing the meanings of these 
basic terms, we learn others from the dictionary and from the contexts 
in which they appear. If everyone had the same basic vocabulary, it 
could be omitted from dictionaries, and then circularities could be avoided. 
In mathematics such a procedure is possible, because we are constructing 
a language rather than describing one that has developed spontaneously. 
Accordingly, we can choose certain terms as undefined and define others 
m terms of these. 

Terms that are not defined in a mathematical theory are called basic 
terms of that theory. Basic terms are used to define other terms, but are 
not defined within the theory itself. It does not follow, however, that we 
have no idea what our basic terms mean. On the contrary, we usually 
have some more or less definite idea of their significance. It is just that 
we must start with some terms as undefined if we are to avoid circularity 
or. an endless series of definitions. For example, in this chapter we have 
been defining various special terms by using ordinary words of everyday 
language. These ordinary words are undefined in this book. 

In formal mathematical theories we specify what terms are to be basic. 
They may include constants, variables, and formulas. Other terms are 
defined by stating that they are synonyms of expressions that are either 
basic terms or previously defined terms. The new term that is defined is 
called the defined term. The term that is used to define it is called the defin- 
ing term. The definition is made by stating that the defined term is a 
synonym of the defining term, i.e., in the form “defined term = defining 
term.” For example, if we take “a - b” as a basic term, we may define “a?” 
by writing a? = a-a. To indicate that this is a law by definition, we may 
write = q in place of = or precede the definition by “Let” or “Def.” The 
following all mean that “a?” is defined by “a- a.” 


(1) Let a? = a-a. 
(2) a” =ga-a. 
(3) Def. α" = a-a. 


The first form is used for temporary definitions. 
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What is the meaning of this definition of a”, and how did it arise? Does 
a? equal a: a and nothing else? Could a* have been defined differently? 
The meaning of (1) through (3) is simply that we agree to write α as an 
abbreviation for a- a. We could have chosen, instead, to write a sq, 2a, 
or 2%. In fact, the symbol a? is of relatively recent invention, and many 
other symbols were used before a? came to be generally adopted. If we 
used some other symbol for a: a, we could then use a” with a different 
meaning. For example, we could let a? = a + a. Then it would turn out 
that a? = 2aisalaw. We do not do this because the use of α as a synonym 
for α - a turns out to be more convenient. The fact is that words and other 
symbols are man-made and can be used as we choose. Of course, common 
words have well-accepted meanings, and some confusion would be caused 
by using them with unusual meanings. But even common words are used 
with different meanings in different contexts and change their meaning 
with time. So long as we make our meaning clear, we are quite free to use 
words and other symbols as we wish. 


(e) At the conclusion of an astronomy lecture, one of the audience asked: 
“T can understand how astronomers found out the distances, sizes, and composi- 
tions of the stars, but how could they possibly discover their names?” What 
fallacy is involved? (ἢ) According to Our Language, by S. Potter, King James 
II described the great St. Paul’s Cathedral built by Sir Christopher Wren as 
“amusing,” “awful,” and “artificial.” Was the King lacking in aesthetic 
appreciation? 


In mathematics a definition is simply an agreement as to how a symbol 
is to be used. It is a sentence that gives to a symbol, previously without 
meaning, the same meaning as a term already part of the theory. When the 
definition is given in the form “defined term = defining term,” it is called 
a formal definition. When it is given in some other way it is called informal. 
The definitions in (1) through (3) are formal, whereas the definitions we 
have been giving of words printed in italics are informal. We could express 
(1) informally by saying, “We call the product of a number by itself the 
square of that number and write a? for a- a.” Formal or informal defint- 
tions are both used in mathematics, depending upon which is convenient 
in a given context. 


(g) Write a formal definition of 23, assuming that the meaning of x? is known. 
(h) Why is it foolish to ask for a proof of a2 = a-a? (i) Why are all defini- 
tions laws? (ἡ) Why are defined terms usually shorter than defining terms? 
(k) In a mathematical theory we could dispense with all defined terms and 
express everything by using only basic terms. Why? Why is this not done? 
(1) In the formal definition “triangle =a three points not in a straight line 
and the segments joining them,” what geometrical terms are assumed basic or 
previously defined? 
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A formal definition is easily recognized, but most definitions are informal. 
Often they are stated in a way that does not make clear which term is 
defined and which defining. They may even be asserted as though they 
were statements of fact instead of merely agreements about the use of 
symbols. Consider, for example, the following formal definition of “x is 
parallel to y.” 


(4) x || y =a2 and y are lines in the same plane and x does not 
meet y. 


Here “zx || y” is obviously the defined term, and the right member is the 
defining term. Every time we see “z || y” we can mentally replace it by “x 
and y are lines in the same plane and x does not meet y.” However, this 
definition may appear in many forms, of which we give ἃ few. 


(5) We define parallel lines as those that do not meet. 


This has the disadvantage of seeming to define the objects, parallel lines, 
whereas we are really defining the term “parallel lines.” 


(6) If two lines do not meet we call them parallel. 


Here the phrase “we call” is the indication that the sentence is a definition, 
but the statement appears to leave open the possibility that we might 
still call them parallel even if they did meet! It also leaves implicit the 
idea that the lines must lie in the same plane, as did (5). 


(7) If lines do not meet, they are parallel. 


This has the same drawbacks as (6) and, in addition, sounds like a state- 
ment of fact (that might conceivably be wrong), whereas it is actually 
intended to be true by definition. 

In reading scientific literature, it is usually necessary for us to guess 
from the context whether or not a statement is a definition and, if 80, 
what is the defined term. Alertness on this score saves much time that 
would otherwise be wasted by trying to see why a supposed claim is true, 
when it is merely a definition of one of the writer’s terms. 


In each of the following, identify the defined term, criticize, reformulate as a 
formal definition, and give, if possible, a better informal definition: (m) A 
triangle with three equal sides is equilateral. (n) A political plan that cannot 
secure the results it envisages is utopian. (0) A leaf with an even margin is 
said to be entire. 
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Often the meaning of a term is indicated intentionally or unintention- ᾿ 
ally only by the context, without any direct statement of its meaning. 
For example, in Section 1-4 we indicated the meaning of “factor” by the 
way in which we used it to describe multiplication. Of course, guessing 
the meaning of a word from its context may be risky, as illustrated by the 
schoolboy who said that an average was a “rack on which chickens lay 
eggs” because he had read somewhere that chickens lay one egg a day on 
an average! 


Give a formal and an informal definition of the italicized term in each of the 
following: (p) Factor. (q) The magician seeks to control the forces of nature 
in an unnatural or in an unusual way. (1) Avoid overcooking, since cooking 
food too long spoils it. 


In formulating a definition we must decide on both the defined and 
defining terms. Even if we know precisely the expression for which we 
want to adopt an abbreviation, the problem of picking the appropriate 
defined term is usually by no means easy. Some points to consider are the 
following: 


The defined term should not have been previously defined in the 
same context. 


(8) 


The defined term should be either lacking in previous meaning 
(9) in the context or have a previous meaning and associations not 
inconsistent with the defining term. 


The defined term should, if possible, seem to be a natural 


(10) choice. 


(11) The defined term should be conventent to use. 


When it comes to choosing words as defined terms, a compromise among 
these desirable features is usually necessary. A newly coined word always 
satisfies (8) and (9), but often fails to satisfy (10) and (11), while old words 
seldom satisfy (9) completely. For example, in choosing “symbol” as a 
synonym for “any intentional mark” in Section 1-3 we considered other 
words, such as “sign,” and came to the conclusion that “symbol” satisfied 
these conditions better than any other existing word, even though the 
exact meaning we gave to it was somewhat broader than its dictionary 
definition. 

When we choose defined terms of nonverbal mathematics, we are not 
handicapped by previous meanings and can often invent an entirely new 
symbol that satisfies (8) and (9) and seems to have a good likelihood of 
satisfying (10) and (11). As to (11), we cannot tell whether a term will 
be convenient to use until we try it. For this reason it frequently happens 
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that various terms are used with the same meaning in mathematics. 
Sometimes one finally emerges as the best, and sometimes several remain 
in use permanently, each in a context where it is most convenient. 


(s) Give an example of two mathematical symbols that designate the same 
arithmetical operation. (t) Why is “a?” superior to “a cubed”? 


There is no problem of choosing a defining term if we already have an | 
expression for which we wish to adopt an abbreviation. Such is often the 
case in mathematical theories. Usually, however, we have only a more or 
less clear idea of something for which we wish to adopt a term. In such 
cases the problem is to formulate clearly a description of the thing to be 
named, then use this as the defining term. For example, in defining 
“symbol” in Section 1-3 we inserted “intentional,” since otherwise acci- 
dental marks would be included as symbols, and we wished to include 
only marks made intentionally by man. On the other hand we did not 
specify “marks on paper” or “printed marks” because we did not wish to 
limit the meaning of the term in this way. The following may be helpful 
in formulating defining terms. 


The defining term should contain only undefined terms and 


(12) previously defined terms. 
(13) The defining term should have a clear meaning. 
The defining term should name precisely what is intended, 
(14) neither including unwanted items nor excluding ones that are 
desired. 
(15) The defining term for a formula must make clear how its 


values may be found. 


These suggestions are intended to guarantee that the defining term is 
actually a term in the sense of Section 1-5. Definitions that conform to 
(15) are called operational. For example, a? =ga-a is operational, 
because the right member tells us how to compute a? for every value of a. 


(u) Criticize “νῷ = the square root of x” as a definition of να. (νὴ) Which 
of the following is operational? “Economic progress is any improvement in 
economic welfare.” “Economic progress is the increase over time of per capita 
goods and services.” (w) Discuss the following two definitions: “National 
welfare = gross national product as determined by federal statistics.” “National 
welfare = the happiness of the people.” (x) Which criterion is not satisfied 
by the following: “A square is a quadrilateral with right angles at all its 
vertices.”?  (y) Criticize: “A regular polygon is one with equal sides.” 


We close this section with two definitions that will illustrate some of the 
ideas discussed above. 
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You are already familiar with the symbol for square root, ν΄. How can 
we formulate a precise definition? Why is “Ό = 3? Because 9 = 37. 
This suggests the definition (y = Vx) =a (y* = 2). But this is un- 
satisfactory because both (—3)? = 9 and (3)? = 9, so that “ν θ᾽ might 
mean either 3 or —3. Hence with this definition “\/x” would not be a 
formula. To eliminate the ambiguity, we define “\/x” to mean the non- 
negative number whose square is x; that is, 


(16) Def. (y= Vz) = ty? = σι διὰ (y = Oory > 0)). 


With definition (16) we can say that any positive number x has two square 
roots, «/z and —+/z. We call ~/z the principal square root. For the present 
we exclude negative values of x in \/z. 

In Section 1-4 we gave a very informal definition of |a| in geonietine 
terms, saying that it represented the length of the vector a. This was 
not very satisfactory, since there was no precise specification of what 
we meant by length! A more satisfactory operational definition is the 
following. 


(7) Det ja] = a whena > Oora= 0. 
ef. | 
|a| = —awhena < 0. 


Note that the definition has two parts, one specifying how to find | a | 
when a > 0 or a = 0, the other when a < 0. This definition conforms 
to the remarks in Section 1—4 at the end of the paragraph dealing with 
absolute value. 


(z) Hlustrate (17) fora = —10. 


There are many other definitions in elementary algebra. Our purpose 
here was to include a few in order to illustrate the following points: 


(a) Definitions are arbitrary, but they are chosen tn order to 
facilitate calculation and communication. 


(b) Any expression is meaningless and not even a term 


15) until τὲ 18 introduced as basic or defined. 


(c) A formula is defined only for certain values of its varia- 
bles, and is meaningless for other values. 


PROBLEMS 


1. Why are there no circularities in an English-French dictionary? 

2. Which is correct usage: “Define an elephant” or “Define ‘an elephant’”? 

3. Discuss from the point of view of this section one or more of the definitions 
of words given in the book so far. 


Give precise definitions of the italicized terms in Problems 4 through 7 on the 
basis of the context. 
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*4. “To refer to the whole political organization of a particular people inhabit- 
ing a definite territory and ruled by a distinct governmental establishment a 
broader term the state is employed.” (G. L. Field, “Toward a More Objective 
Definition of Political Concepts,” The Southwestern Social Science Quarterly, 
July, 1946) 

*5. “To adopt a term as primitive is to introduce it into a system without 
definining it.” (N. Goodman, The Structure of Appearance, 1951, p. 56) 

*6. The classical writers distinguished between chronicle and history. The 
former records what was done and in what year it happened; history must exhibit 
also the reason and cause of events. 

*7. “... the technique of measuring group morale designed by L. D. Zeleny. 
In its simplest form the technique consists in determining the number of attrac- 
tions or ‘likes’ that are found in a group and expressing it as a ratio of the total 
number of attractions which would be theoretically possible in that group, i.e., 
if each member were attracted to each other member.” (P. F. Lazarsfeld and 
M. Rosenberg, The Language of Social Research, p. 24) 


Answer Problems 8 and 9 with reference to definition (16). 
8. What is /0 and why? 
9. Why is V16 ~# —4? 
10. If a > 0, what is Va?? If a < 0, what is Va?? Show that 


(19) (Va)? = a, 
(20) Va? = fal, 
(21) Va > 0. 
11. Show that 
(22) Vavb = ν αὖ and Va/Vb = Va/b. 


12. Use (22) to simplify (a) V2-9, (b) V18, (ὁ) V8, (ὦ V/4a3, (e) V8a2, 
(f) Vai, 

13. Simplify the following: (4) (a+ Vb)(a— Vb), (Ὁ) (a+ ν) 3, 
(c) ("8 — V2)?, (d) (Vz — ν 2). 

14. Is there any real number whose square is —4? Is there any real number 
whose square is 2, if 2 < 0? Why? 

Answer Problems 15 through 17 with reference to definition (17). 


15. What is [0] and why? 
16. Does “/a|” mean that a is positive? 
17. Why is |a?| = a? a law? Why is |z| = x not a law? 


18. Show that 


(23) [α|}Ὁ] = [αὖ], 
(24) la| = |—al, 
(25) [α > 0. 


19. Define “z%” and “zx is 2% of y.” 
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ANSWERS TO EXERCISES 


(a) Presumably their ambition was to use common words in defining uncom- 
mon ones, a very laudable procedure. However, in defining the common ones 
they would then have to “land you right back where you started.” (Ὁ) Suppose 
the words are a, δ, c,d, and e. Then ὃ has to be used to define a. If we are not 
to “land right back,” a different word must be used to define ὃ, say 6. Then still 
another must be used to define c. In this way d is used and e must be used 
to define it. Now we cannot avoid using a word again. So when we look up 
the fifth word, we get a circularity. Of course, we might get one sooner. 
(c) Yes. (4) Circularity usually occurs in only two or three steps. (6) As- 
tronomers assigned the names. (f) In his time, amusing = pleasing, awful = 
inspiring awe, and artificial = well constructed. 

(g) a3 =g a-a?; Let a? = a-a?; or Def. a? = a-a*. (8) Because it is 
true by our decision as to the meaning of the symbols. (i) Same as (h). 
(j) They are introduced for brevity and convenience. (k) Because we could 
replace all defined terms by their synonyms. It would make for awkward 
communication. (1) “Point,” “straight line,” “segments,” and “joining.” 
(τ) Equilateral triangle =a triangle with equal sides. (n) Utopian plan =q 
plan that cannot secure the results it envisages. (0) Entire leaf =a leaf 
with an even margin. Note that in some contexts “entire leaf ” would mean 
simply a whole leaf, but this is no serious drawback, since the context would 
indicate the meaning. 

(p) Factor = the value of a term that appears as does “a” or “δ᾽ in "α΄ ὃ." 
(q) A magician =a a person who seeks ἴο.... (Ὁ) Overcooking =g spoiling 
food by cooking it toolong. (8) -andX;or/and~+. (Ὁ) Easier to manipulate; 
shorter. (u) Does not help in that one does not know what “square root of x” 
means. Gives no means of finding Vz. (νὴ) Second. It is also perhaps too 
narrow. (Ὁ) First may be too narrow, but is operational. Second is non- 
operational. (x) Too broad, since it includes rectangles that are not square. 
(y) Too broad, since it includes polygons with unequal interior angles. 
(z) —10 < 0. Hence we substitute (a:—10) in the second part of the definition 
to get |—10] = —(—10) = 10, as expected. 


ANSWERS TO PROBLEMS 


1. English terms are all undefined. 2. Second. 4. A state =a the whole 


political organization . . . by a distinct governmental establishment. 5. Primi- 
tive term =q a term that is undefined. 8. 0 since 0? = Oand0 = 0. 9. —4 
isnot > θοῦ = 0. 10. Va2 = a when a > O or a = 0; Va? = —a when 


a < 0. (20) is true since the right members of these two equations are the same 
asin (17). 11. (Wav)? = VavbVavb = (Va)2(Wb)? = aband VaVb > 0 
or = 0. 12. (a) 3V2, (b) 8V2, (Ὁ 2V2, (4) 2lalVa, (e) 2lalV2, (ἢ a?. 
13. (a) a2 —b, (b) a2?+2avb+5, (Ὁ 1—2V6, (d) τῷ — 2V2). 
19. 2% =a 2/100; x is 2% of y =a (x/y = 2/100). 
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1-14 Fractions. A fraction is an expression of the form a/b. The 
properties of fractions are consequences of the following definition. 


(1) Def. (a/b = c) = (a = be). 


Is this an adequate definition in terms of the criteria of the previous sec- 
tion? Does it enable us to find a/b for any numbers a and b? We know by 
experience that it does in many cases, since the methods of division we 
learned in school consist of taking trial divisors and “multiplying back.” 
For example, we know that 63/7 = 9 because 63 = 7.9. 

It is not hard to see that (1) does not define a/b for b = 0, for the sub- 
stitution (6:0) in (1) yields [4/0 = c] = [a = Ο- ε΄. Now 0-c= 0. 
Hence if b = 0 and a = 0, there is no value of c that makes the right 
member of (1) true, and hence no value of a/b is determined by (1). On 
the other hand, if a = ὃ = 0, the right side of (1) becomes 0 = 0-c. 
But this is satisfied by any value of c. Hence in this case (1) fails to define 
a unique value of a/b. We have shown that a/b is undefined for b = 0. 
Hence we say that division by zero is undefined, and we exclude 0 as a 
value of any denominator. Does (1) define a/b for all nonzero values of 
δ᾽ The answer is yes, but we postpone a proof until Section 1-16. 

From (1) we can easily derive the laws in Section 1-9 that dealt with 
fractions. For example, (1)(c:a/b) is [a/b = a/b] = [a = b(a/b)]. The 
left member is obvious; the right is just (1-9-17). Now (1-9-27) follows 
by the substitution (a:1, b:a) in (1-9-17). Returning to (1), the sub- 
stitution (c:a(1/b)) yields [a/b = a(1/b)] = [a = b(a(1/b))]. But the 
right member is just a = a, since b(a(1/b)) = b(1/b)a = a by rear- 
rangement and (1—-9-17). Since the left member is (1-9-19), that law is 
proved. 


(a) Prove (1-9-21) by substitution in (1). (You may, of course, assume 
the identities having to do with multiplication.) (Ὁ) Justify (—8)/(—2) = 4 
from (1). 


(2) ca/cb = ab. 


Identity (2) says that the value of a fraction is unchanged if its numera- 
tor and denominator are multiplied (or divided) by the same term. It 
follows from (1) (a:ca, b:cb) since (cb)(a/b) = c(b(a/b)) = ca by re- 
arrangement and (1-9-17). It can be used to great advantage in simplifying 
fractions. For example, it is the law that permits us to write 9/24 = 3 /8, 
since 9/24 = 3-3/3-+-8 = 3/8. The last equation is obtained from (2) by 
(a:3, 6:8, ¢:3). 


(3) a/d + b/d = (a+ b)/d. 
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Law (3) tells us how to add fractions with the same denominator. We 
do not need a separate law for fractions with different denominators, 
because we may always write fractions so that they have any desired 
denominators by using (2). For example, to add 1/2 and 5/6, we cannot 
use (3) directly since the denominators are not the same. However, by 
(2) 1/2 = 3/6. Hence 


(4) 1/2+5/6 = 3/6+ 5/6 (2) 
(5) = (3+ 5)/6 ὦ) 
(6) = 8/6 8 - ὅ -- 8, 
(7) = 4/3 (2). 


Of course, we would not think of writing out all the details of such a simple 
calculation except to show how each step depends on the application of a 
law. 


(c) Use (2) to simplify 1/(1/2). (4) Indicate the substitutions used in the 
laws indicated in steps (4), (5), and (7). (6) How was the rule of replace- 
ment used in (4) through (7)? 

Perform the following as above, indicating the laws and substitutions used: 
(ἢ 2/3+ 5/6. (ἡ 1/3+ 4/5. (h) 4/5+ 1/6+ 2/3. (i) Prove (3) by 
(1) (a:a + ὃ, b:d, c:a/d + b/d). (Ὁ) Show that a/b -+ c/d = (ad + be)/bd. 


The following laws cover multiplication and division of fractions. 


(8) (a/b)(c/d) = ac/bd, 
(9) (a/b)(b/a) = 1, 
(10) 1/(a/b) = 6/a, 
(11) (a/b) /(c/d) = ad/be. 


The first follows from (1)(a:ac, b:bd, c:(a/b)(c/d)), since bd(a/b)(c/d) = 
b(a/b)d(c/d) = ac by rearrangement and a double application of (1-9-17). 
Then (9) follows by applying (8) to get (a/b)(b/a) = ab/ba = 1, the 
last equality being justified by rearrangement and (a/a = 1)(a:ab). 
Then (10) follows from (1)(a:1, b:a/b, c:b/a) and (9), and (11) follows 
from (1)(a:a/b, b:c/d, c:ad/be) and (2). 

Simplify, indicating laws and substitutions: (k) 27/30, () (3/4)(2/3), 
(m) (3/4)/(2/3), (ὦ) 1/(1/8), 0) (—2/8)(3/(—2)). 


For fractions involving minus signs, the following are useful. 


(12) a/b = (—a)/(—6), 
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(13) —(a/b) = (—a)/b 
(14) | = a/(—d). 


Simplify, indicating laws and substitutions: (p) (—2)/(—3), (q) (—7)/2, 
() 8/(—4), (5) 0,)(--ὃ),, (ἢ) (—2)(—3)(—27)/(—9). 


By using the identities of this and previous sections we can perform all 
the manipulations needed to rewrite algebraic expressions in any desired 
form (provided, of course, that the desired form is synonymous with the 
given expression!). In particular, by using (2) and the distributive law 
we can easily manipulate fractions. For example, 


z—1/3 | 6(@ — 1/8) 
τη 3/2 6(@ + 3/2) 
_ 6% — 2 
— 64 +9 
_ 2(82 — 1) 
᾿ς ϑ(ῶ2 + 38) 
(18) = (2/3)((8x — 1)/(2e + 3)) (8. 
Which one of these synonymous expressions is considered the simplest 


depends on the situation. 


Simplify: (u) (ὦ + 2y)/(2y + 2), (v) (8% — 2)/(2 — 32), 
(w) 1/(a@ — b) + 1/2, (x) (8/4 — 1/z)/(2@ - 1/2), ( 1/(1/2). 


Just as the laws having to do with division follow from (1), so those 
having to do with subtraction follow from the next definition. 


(15) (2)(a:z — 1/8, δ: + 3/2, c:6) 


(16) (Distributive law) 


(17) (Distributive law) 


(19) Def. l@a—b=c]=fa=b+¢]. 


The corresponding laws are paired in Section 1-9. Of course, 0 plays the 
role for subtraction that 1 does for division. We leave the development 
of this idea to the reader (see Problems 20 and 21). 


PROBLEMS 
Show that 1, 2, and 3 are not laws. 
1. 1/a+1/b = 1/(a+ δ). 
2. (1/a+ 1/6)? = 1/a? + 1/62. 
3. 1/(1/a+ 1/b) = a+b. 
4. Simplify (1/a — 1/b)/(1/a + 1/0) without adding the terms in numerator 


and denominator. 
5. Prove (1-9-21), (1-9-29), and (1-9-32) by substitutions in (1). 
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Show that 

a+tb,a—b _ atb a—b _ 
6. 9 i 9 = a, “τὰν τα 5 = b. 
8. a+ b/c = (ac-+ b)/c. 9. a/b — 1 = (a — δ)78. 


10. a/(a+ δὴ) +1 = (2a-+ b)/(a-+ δ). 
11. Prove (3) by using the distributive law and a/b = a(1/0). 


12. Show that ἢ- 


(w—a) =p—w-—ap/wta. (History of Kco- 


nomic Analysis, by J. A. Schumpeter, p. 467) 

13. Complete the following: The argument following (1) shows that “Ὁ or 
any term synonymous to it is not a _____ for “δ᾽ in the formula “a/b.” 
In fact “a/0” is not a formula, because _______ . 

14. “The formula given earlier for the coefficient of determination is: 
1 — (Error variance)/(Total variance).... It follows that an equivalent ex- 
pression .. . is as follows: (Total variance — Error variance)/(Total variance). ἢ 
(A Primer of Political Statistics, by V. O. Key, p. 115) Derive the second formula 
from the first. 

15. “Assign to 01 the value Ri/(Ri-+ Re+ Κα τ Ha), to Og the value 
Ro/(Ri + Re+ Κατ Ra), to 03 the value R3/(Ri + Ro+ R3-+ Ra), and 
to 04 the value R4/(Ri + Re+ R3-+ Ra). The sum of these values should 
be equal to 1.” (The Design of Social Research, by R. L. Ackoff, p. 25) Justify 
this statement. 


16. Show that p/(l — (1 — p))? = 1/p and - 


f= ΞΡ το 
(Readings in Learning, by L. M. Stolurow, p. 53) 

17. Show that (a + a’)/(b-+ δ) = a(1 + a’/a)/b(1 + δ΄ ,8). (A Geometry of 
International Trade, by J. E. Meade, p. 13) 

18. Solve for 7:1/7 = 1/S+.1/E. (Manual of Astronomy, by R. W. Shaw 
and §. L. Boothroyd, p. 98) 
#19. In music an interval between two tones is measured by the ratio of the 
frequencies as follows: octave, 2/1; fifth, 3/2; fourth, 4/3; major third, 5/4; 
minor third, 6/5; major sixth, 5/3; minor sixth, 8/5; second, 9/8. To find the 
sum of two intervals we multiply the ratios; to find the difference we divide. 
(a) The sum of 2 fifths is called a ninth. What is its ratio? (b) Show that a 
fifth less a fourth is a second. (c) Show that the difference between 12 fifths 
and 7 octaves is 531441/524288. (d) Show that the major third taken three 
times differs from the octave by 125/128. 
*20. The number 7 may be approximated by the equation 


4/r = 1+ 32 
2+ —3 
2-+ a 

Solve for πὶ and express as a decimal. 

#21. From (19), derive formulas (16), (18), (20), (26), and (28) of Section 1-9. 
+22. What are the analogs for subtraction of (2) and (8) through (11)? 
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ANSWERS TO EXERCISES 


(a) (1)(6:a@, ¢:1) is [a/a = 1] = [@ = a-1], and the right member is true 
by (1-9-25). (0) [(—-8)/(—2) = 4] = [—8 = (—2)4]. (ec) (a:1, b:1/2, 0:2). 
(d) (2)(a:1, 6:2, ¢:3), (8)(a:3, b:5, 4:6), (2)(a:4, b:3, 5:2). (6) Each right 
member was obtained by a replacement in the previous one. (f) 3/2. 
(g) 17/15. (8) 49/30. (i) - Use distributive law and (1-9-17). (j) a/b + 
c/d = ad/bd + cb/bd =.... (k) 9/10. (il) 1/2. (m) 9/8. (n) 3. 
(0) 1. (Ὁ) 2/3 by (122. (q) —7/2 by (13). (ἡ —2 by (14). (8) 0 by 
(1-9-32). (t) 18. (u) 1. (v) —1. (w) (2 - a — b)/2(a — δ). 
(x) (8a — 4)/4(z?+ 1). (y) =. 


ANSWERS TO PROBLEMS 


I, 2, 3. (a:b, 6:1). 4. (2)(a:1/a — 1/0, b:1/a + 170, c:ab). 6 through 10. 
See note in Problem 17 in Section 1-12. 11. a/d+ b/d = (1/d)a+ (1/d)b = 


(1/d)(a+b) = (a+ d)/d. 12,27 


(pw — pa — w*-+ wa)/w = p -- ὦ — pa/w+a. 18. significant substitute: 
it does not become a constant for any substitution for a. 


(w — a) = (p — w)(w — a)/w = 


I-15 Decimals. Which is larger, 11/12 or 23/25? The easiest way to 
find out is to express both numbers in decimal form. By long division we 
find 11/12 = 0.91666... and 23/25 = 0.92. (The dots in 0.91666... 
are used to suggest that the 6 is repeated indefinitely.) Evidently 23/25 
> 11/12. 


(a) Check the above by carrying through the long division. (b) Which 
is larger, 5/7 or 44/61? 


For purposes of such comparisons and for convenience in calculation, 
real numbers are often expressed by decimals. As we see from the above 
example, it is not always possible to write a terminating decimal, such as 
0.92, to represent a number. An infinite decimal, such as 0.91666... , is 
often required. Since a terminating decimal may be thought of as an 
infinite decimal with only zeros after a certain point, we may think of all 
decimals as infinite decimals. More precisely we think of a decimal as an 
expression of the form 


(1) N.dydodsd4 ee He Gy aia or —N.d,ded3 Bad dy sey 


where Ν᾽ is a non-negative integer and the d’s are digits. We call d, the 
digit in the nth decimal place. The dots suggest the endless sequence of 
digits, some or all of which may be zero. 


(c) What is a digit? (d) What is the digit in the fourth decimal place in 
— 27.391204? 
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When the decimal ends in zeros after a certain point, we call it termz- 
nating. When after a certain point a decimal consists of the repetition of a 
digit or group of digits, we call the decimal repeating. We indicate this by 
writing dots over the repeated group of digits. For example, we write 
11/12 = 0.916. A terminating decimal is also a repeating decimal, since 
after a certain point it consists of a repetition of the digit 0. 

Any rational number can be represented by a repeating decimal, which 
can be found by long ἐὐΐπειοηι For eras 2 = 2. 0, —31/ 10 ; = —3.10, 


ee 9 8 »ν 


1.7142857 ... 
7*/12.0000000. . . 
7 
580 
49 
~ 10 
7 
(2) 
20 
14 
60 
56 
“40 
35 
“50 


The calculation suggests the proof of our claim that any rational num- 
ber can be represented by a repeating decimal. Suppose we try to find 
m/n by long division, where m and n are integers. Since the decimal 
representing m consists of zeros after the decimal point, there comes a 
time in the division after which we are “bringing down” only zeros. If 
after this point the same remainder turns up twice, as 5 does in (2), the 
group of digits since its last appearance will be repeated from there on. 
But when dividing by n there are at most n different remainders. Hence 
a remainder is bound to reappear in at most 7 steps. 


(6) If zero appears as a remainder, does this spoil the argument? (ἢ) Argue 
that in the repeating decimal representing a rational number 7/s in lowest terms, 
there are at most 8 digits in the smallest repeating cycle of digits. (6) Find 
the decimal representation of 15/7, (8) 4/9, (i) 18/18. 


We have seen that every rational number can be represented by a re- 
peating decimal. Is the converse true? Is every repeating decimal a 
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rational number? Consider, for example, x = 3.91 = 3.9191.... Since 
multiplication by 100 may be indicated by shifting the decimal point 
two places, we have 1002 = 391.91... = 391.91. Then 1002 — x = 


391.91 — 3.91 = 388. Hence 992 = 388 and x = 388/99. The cal- 
culation may be written as follows: 


100z = 391.919191... 
(3) —x = 3.919191... 
99% = 388.000000... 

x = 388/99 


This example suggests the following rule: To find a ratio of natural num- 
bers equal to a repeating decimal x, multiply by 10”, where n is the number 
of digits in the repeated group of digits, subtract x, and divide by 10” — 1. 
(If this gives the ratio of two terminating decimals, multiply numerator 
and denominator by some power of 10 to get a ratio of integers. ) 


(j) Prove that (10% — x)/(10" = 1) = 2. Use the rule to find the ratio of 
integers equal to (1) 0.3, (1) 0.35, (m) 2.31, (ὦ) 1.9. 


It is evident that the above process applies to any repeating decimal, 
but the reader may ask whether we are justified in manipulating infinite 
decimals, as we did in (3), as though they were terminating decimals. The 
answer is that it can be justified rigorously in various ways, all of which 
are too time-consuming for our purposes here. Moreover, in each case 
the above calculation can be checked in reversé by long division. 

In Exercise (n) the reader found that 1.9 = 2 = 2.0. Apparently, 
there may be more than one repeating decimal representation of a rational. 
Let us consider a repeating decimal ending in nines, © == 7.0,a2... a9. 
Then x = n.ajde...a; + 0.00... 9, where there are 7 zeros preceding 
the repeated part. But by a calculation similar to (3), 0.00...9 = 
0.00...1, where there are i — 1 zeros preceding the 1. Hence x = 
N.a,d2...(a; + 1)0. Thus we see that any decimal ending in nines is 
equal to the decimal found by increasing by one the last digit preceding 
the nines and changing the repeated nines to zeros. 


(0) Use the procedure of (8) to find 0.9, 0.09, 0.009. 0.0000009, and 32.1859. 


To avoid this double representation of some rationals, we agree to ex- 
clude decimals ending in nines. Accordingly, from now on “repeating deci- 
mal” refers to repeating decimals other than those ending in nines. With 
this understanding, there is just one repeating decimal corresponding to 
each rational and one rational corresponding to each repeating decimal. 

What about irrational numbers? Clearly we must use nonrepeating in- 
finite decimals to represent them. It is easy to give examples of non- 
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repeating infinite decimals. For example, 0.1010010001..., where it is 
understood that after each “1” we put one more “0” than previously. The 
decimal expansion of ν 2, 7, or any other irrational number does not re- 
peat. Nevertheless, the digit in every decimal place is definitely deter- 
mined by the number. 

To see why this is so, we recall from Section 1-2 that a real number is 
one that represents a point on an axis. Suppose that we have a real num- 
ber x, which we assume positive to simplify the argument. We shall show 
that regardless of the position of x, there exists an infinite decimal repre- 
senting it. If x is an integer, we have its decimal equivalent immediately. 
If not, it lies between two integers of which the smaller gives us the in- 
tegral part of our decimal. Divide the segment between these two integers 
into tenths. Again if the point lies on one of these points of division, we 
have a terminating decimal. If not, it lies in the interval between two of 
them, and the smaller gives us the first place in our decimal. Now we 
divide this last interval into ten equal parts (of length 0.01) and repeat the 
process. Evidently, if the position of the point is exactly known (and this 
is what we mean by saying that we have a definite real number given), 
we can determine the digit in any decimal place in its decimal expansion, 
1.6., the decimal is uniquely determined. Hence, to every real number 
there corresponds a unique infinite decimal. 


(p) Why is there no difficulty here with decimals ending in 9’s? 


Conversely, suppose we are given an infinite decimal, N.didad3... | ee 
To find the corresponding point we first go to the integral point Δ΄, then 
d,/10, then d2/100, and so on as indicated in Section 1-2, except that now 
we never come to the last step unless the decimal happens to end in zeros. 
Nevertheless, it seems plausible that there is a unique corresponding point. 
For when we reach the point N.d;, we know that the corresponding point 
(provided it exists) lies between N.d; and N.(dy + 1). When we reach 
N.d,d2, we know that it lies between this and N.d,(dz2+ 1). At the next 
step we know our point lies between V.ddeds and N.d,do(d3 + 1), and 
at the nth step between N.dided3 ... dy and N.dided3...(dn +1). We 
have suggested these intervals in Fig. 1-25. Actually each of the seg- 
ments is 1/10 the length of the previous, but we have distorted their size 
in the sketch. 

We see that the process generates a sequence of segments each within 
the previous one and each 1/10 the length of the previous one. This se- 
quence is without end. (If the decimal ends in zeros, after a certain point 
all segments have their left ends coincident.) Since their lengths approach 
zero, it seems plausible that there is one and only one point common to 
them all. Certainly there can be no more than one. For if there were two 
different points, however close, there would come a segment too small to 
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N Ν. De Ν. (ἄν + 1) Vea 
N.didg “N. dy(dp + 1) 


FigurRE 1-25 


contain them both. If we assume that an infinite sequence of intervals, 
each contained in the previous and approaching zero in length, contains 
ἃ common point, then this argument shows that each infinite decimal 
determines a unique point, i.e., a unique real number. 


(q) To illustrate the above argument consider the decimal expansion of 7, 
3.14159.... Complete the following: From the first two digits we know that 
m lies between 3.1 and ______. Using two decimal places, we know that it 
lies between 3.14 θὰ ο. Using three, between _.______ and_ 3.142. 
Using four, between ____s and ___ i Using five, between 
8 8. ὁ, The length of this last interval is . (r) We have 
excluded decimals ending in 9’s. However, if we applied the process of the 
previous paragraph to such a decimal, what would be the nature of the in- 
tervals and the corresponding point? 


The above arguments are intended to make more reasonable the identi- 
fication of real numbers and points that we mentioned in Section 1-2. 
We shall henceforth identify infinite decimals with real numbers. Then we 
may sum up the above discussion by saying that there is a one-to-one cor- 
respondence between real numbers and points on an axis. 


PROBLEMS 

Arrange in order of increasing size. 

1. 8/7, 28/25. 2. 3.1416, π, 22/7. 

3. V3I, 5.61, 5.5678. 4, 23/11, v/9, 2.081. 
Express as ratios of integers. 

5. 35.27. _ 6. 35.027, 

7. 2.142857. 8. 8.113. 
Find the decimal expansion. 

9. 1/11. 10. 3/11. 11. 23/51. 12. 1/17. 


* 13. In the arguments showing the one-to-one correspondence between points 
and decimals, we assumed positive decimals. Give an argument for negative 
decimals. 
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#14. The following ratios are of interest in the theory of music: 81/80, 125/128, 
531441/524288. Range them in order of size. (1 ntroduction to Musicology, by 
Glen Haydon, p. 33) 

15. For another elementary discussion of decimals and real numbers, see T’he 
Theory of Numbers, by B. W. Jones, pp. 31 ff. - 


ANSWERS TO EXERCISES 


(b) 44/61. (ὁ) One of the symbols 0, 1, 2, 3, 4, 9, 6, 7, 8, 9, (d) 2. 
(e) No, since zero is then repeated. (6) 2.142857. — (h) 0.4. (i) 1.153846. 
(j) 10% — 2 = 2(10"— 1. (k) 1/8. (ὃ 35/99. (m) 104/45. (n) 2. 
(o) 1, 0.1, 0.01, 0.000001, 32.186. (Ὁ) The procedure would lead us to a 
terminating decimal equivalent to one ending in nines. (q) 3.2; 3.15; 3.141; 
3.1415, 3.1416; 3.14159, 3.14160; 0.00001. (Ὁ) After a certain interval, all 


intervals would have the same right endpoint. 


ANSWERS TO PROBLEMS 


1, 28/25, 8/7. 3. 5.8678, 31, 5.61. 5. Check by long division. 9. Check 
by reverse procedure. 


1-16 Axioms for the real numbers. If we wish to convince a person 
of the correctness of a statement in such a way that our success does not 
depend on his weaknesses or our cleverness and so that our argument would 
be equally convincing to others, we must adopt a method of proof that 
meets with general agreement. Whatever this method is, it must certainly 
involve making statements, since it is hard to conceive of convincing any- 
one of anything without communicating with him. So we must carry on 
the proof by means of statements. But suppose that the listener objects 
to some of our statements. Then we must convince him of their validity 
before we can continue. We can do this only by making other statements. 
Again, if our listener does not agree with these, we should have to try to 
justify them—of course by making still other statements. Evidently we 
cannot convince anyone of anything unless we can get him to agree to 
something! Apparently we cannot prove all statements any more than we 
could define all terms. We must begin by assuming some statements 
without proof. 

In mathematics, sentences that are assumed to be laws without proof 
are called axioms. Synonyms of “axiom” are “postulate,” “basic as- 
sumption,” and (outside mathematics) “hypothesis.” Sentences that are 
proved to be laws are called theorems. The previous paragraph is intended 
to convince the reader that if we wish to prove any theorems we must be- 
gin by assuming some axioms. 

In previous sections we have proved some laws of real numbers by ap- 
plying the rules of replacement and substitution to other laws. To avoid 
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confusion and possible circularity, it is best to pick out a few laws as 
axioms and derive all others from them, just as is customarily done in 
high-school plane geometry. In the following axioms we may think of 
real numbers as infinite decimals, points on an axis, vectors on a line, or in 
any other way not inconsistent with the axioms. 


If a and ὃ are any two real numbers, there exists a unique 


(1) Ax. real number a + ὃ, called the sum of a and ὃ, and a unique 
real number ἃ " ὃ, called the product of a and ὃ. 

(2) Ax. atb=b-+a. 
(Commutative laws) 

(3) Ax. a-b = b-a. 

(4) Ax. a+(b+ec)=(a+b)4+e. a 
(Associative laws) 

(5) Ax. a:(b-c) = (a-b)<c. 

(6) Ax. a:-(b+c) = (α - δ) + (a-c). (Distributive law) 

(7) Ax. There 18 a real number 0 such that a + 0 = a. 

(8) Ax. There is a real number 1 such thata-1 = a. 

(9) Ax If ais a real number, there exists a unique real number 

: —a, called the negative of a, such that a + (—a) = 0. 
If α 18 a real number not equal to zero, there exists a 
(10) Ax. unique real number α΄, called the reciprocal of a, such 


thata-a—! = 1. 


Note that there are no definitions here of the terms or operations in- 
volved. The words and other symbols that appear are all undefined. 


(a) List the undefined constants that appear in these axioms. (b) List 
the undefined formulas. (c) Prove that (—a) + α = Qanda-la = 1. 


We now wish to prove that all the identities of algebra stated in previous 
sections can be derived from these axioms. Since some of these identities 
have already been derived from others, it will be sufficient to tie in these 
axioms with our previous derivations. 

First we observe the importance of (1), obvious as it seems. Without 
it we could not even write a + ὃ = a+ b, for if the sum were not unique 
the two members might be different! From it we know that if a = b and 
ὁ = d, then a +c = ὃ -ἰ ἃ and ac = bd. This justifies the rules given 
in Section 1-6 for manipulating equations. Axiom (1) also justifies the 
manipulations of infinite decimals in Section 1-15. 

Next we derive the identities in Section 1-9 having to do with addition 
and multiplication. We note that (1-9-22) is just (7), and (1-9-23) 
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follows from (7) and (2). Similarly, (1-9-24) and (1-9-25) follow from 
(8) and (3). (1-9-26) is just (9), but in order to get (1-9-27), we need to 
show that a~! = 1/a. For this we need the definition of division (1-14-1). 
According to this definition, [1/a = a~*] = [1 = a(a~*)]. But the right 
member is true by (10). Hence 1/a = a~'. Now (1-9-27) is seen to be 
the same as (10). 


(4) Why did we not use 1/a in the axioms instead of a~!? 


To prove (1-9-30) we write 


(11) a:‘0=a-0+0 (a = a+ 0)(a:a - 0) 

(12) = a:0+aa-+ (—aa) (a + (—a) = 0)(a:aa) 

(13) = a(0+ a) + (—aa) (a(b + c) = ab + ac)(b:0, c:a) 
(14) = aa + (—aa) (0+ a= a) 

(15) = 0 (a + (—a) = 0)(a:aa). 


Here we have given reasons in full, showing the laws and substitutions. 
Below we shall usually omit part or all of the reasons. 


(6) Prove (1-9-31) immediately from the above and (3). (ἢ Prove (1-9-31) 
as in (11) through (15), using the right distributive law, which was derived in 
Section 1-12 solely from our axioms. 


Now we can easily derive (1-9-33) through (1-9-37). First we prove 
(1-9-34) by 


(16) (—la = (--1)ὴα + 0 

(17) = \(—De+o-+(=@) 
(18) = ((—1) + 1) + (—a) 
(19) = 0-a-+ (—a) 

(20) = 0+ (—@) 

(21) = --α. 


() Fill in complete reasons for the steps in (16) through (21). 


We now have, similarly, (—1)(—1) = (—1)(—1) +. 0 = (—1)(-1) + 
(--1 +1 = (-2(-1) + (121 +1 = (-)((--Ἠ1} +) +1 
(—1)-0+1=0+1=1. With (—1)(—1) = 1 we can quickly 
derive the other identities concerned with negatives. For example, 
(—a)(—b) = (—l)a(—1)b = (—1)(—l)hab = 1-ab = ab. Also, 
(—a)b = (—1l)ab = —ab. 


(h) Similarly prove (1-9-33) and (1-9-36). 
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As shown in Section 1-14 we can derive the identities concerning frac- 
tions from the definition (1-14-1). However, we need still to show that 
(1-14-1) does define a unique number a/b for all real a and b, provided 
δ ~ 0. First we see that it does determine at least one such real number. 
For if 6 # 0, we have b(ab~*) = abb~! = a-1 = a by rearrangement, 
(10), and (8). Hence ab is a value of ὁ that satisfies the right member of 
(1-14-1) and a/b = ab~!. Now suppose that there were two values ὁ 
and c’ that satisfied the right member of (1-14-1), that is, a = be and 
a = bc’. By the rule of replacement be = be’. Now we multiply both 
members by b~’, which is possible if b ¥ 0, to get b~4be = b~ be’. But 
δ = 1, so this becomes ὁ = ο΄, and we have proved that the quotient 
c is uniquely determined. 


(i) Similarly show that (1-14-19) defines a unique number 6. 


We have now established all the identities in Section 1-9. Since those 
appearing in later sections were proved from them with the addition of 
the distributive law in Section 1-12 and the definitions of Section 1-14, 
we have completed our task of showing that the axioms are sufficient to 
derive the identities of previous sections. Actually, for the purposes of 
elementary algebra we need no further axioms about the real numbers. 


PROBLEMS 


1. We adopted a special symbol, a—!, for the reciprocal for reasons given in 
the answer to Exercise (d). Why did we not have to adopt a new symbol for the 
negative? 

2. How have we used (4) and (5)? 

3. Write a detailed proof of (—1)(—1) = 1 along the lines indicated. 

4. Similarly, write a detailed proof with reasons for (—a)(—b) = ab. 

*5. Suggest an alternative system of axioms that could serve in place of (1) 
through (10). 

*6. Show that 0 is the only real number with the property (7). (The number 0 
is sometimes called the identity element for addition.) 

*7. Similarly, show that 1 is the only real number with the property (8). 
(The number 1 is sometimes called the identity element for multiplication.) 
*8. It is not true that any definition will yield a uniquely defined object. To 
illustrate this consider the following definition: (a/b) 6 (c/d) =a (a+ c)/(o+ d), 
where @ is a new operation consisting in “adding” rational numbers by adding 
numerators and denominators. Show that this operation does not yield a 
unique result by showing that if a/b = a’/b’ and c/d = c’/d’, then it does 
not follow that (a/b) θ (c/d) = (a’/b’) θ (c'/d’)! 

*9. Suppose in (1) through (10) we inserted “rational” for “real.” Would all 
the axioms hold? 

*10. Answer the same question for “integer” in place of “real number.” 
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ANSWERS TO EXERCISES 


(a) “0,” ay (b) “a+ δ," “a+b,” er and “a-!.” (c) (9), (2); and 
(10), (5). (4) If we had used 1/a we would be in the position of defining a 
previously introduced undefined term in our definition of division (1-14-1), 
contrary to the rules of Section 1-13. (ὃ 0-a =0-a+0=0-a + aa+ 
(--αα) = (0+ a)a+ (—aa) = αα -ἰ- (—aa) = 0. (g) (α -Ἐ 0 = αλία:(---1}α); 
at (—a)=0; (+ c)a = ba+ δο)(δ:---Ἰ, 6:1); ((—a) + α = O)(a:1); 
(0. α = 0;0 - a = a)(a:—a). 

(h) —(—a) = (—1)(-l)a = 1:-α =a; a(—b) = a(—1)b = (—1)ab = 
—ab. (i) a+ (—b) satisfies the definition by substitution in the right mem- 
ber of (1- ae. rearrangement, (7), and (9). Ifa=b+canda=b+¢, 
then b-+ c = 8-ἃ ο΄. Adding (—b) to both members, we have c = c’. 


ANSWERS TO PROBLEMS 


1. —a is already a symbol not previously appearing nor defined in (1~14-19), 
since the latter defines only the formula a — ὃ and not —bd standing alone. The 
formula 0 — a for the negative would have the same disadvantage in the 
axioms as would 1/a for the reciprocal. 2. In rearrangements. 5. For example, 
0-+ a = ain place of (7). 


*1-17 Check list for reading mathematics. In this chapter we have 
been at some pains to explain everything in detail. Usually in mathe- 
matical discourse this is not possible. As a rule, explanations are incom- 
plete. Moreover, important parts of calculations and manipulations may 
be omitted. For these reasons the reader of mathematical discourse must 
take an active part. He must use his mind (and also plenty of paper) 
to fill in the more or less bare outline that he reads. The following check 
list suggests some activities that may be helpful. 

1. Give several interpretations in words. 

2. If possible give a geometric interpretation, visualize, and make a 
drawing. 

3. Experiment by substituting constants and formulas for variables 
that appear, and interpret the results. 

4. Decide on the values and significant substitutes for any variables 
that appear. 

5. Think of ΠΤ if you can. 

6. Construct informal arguments, making full use of the meaning of 
terms, to establish the plausibility of axioms, definitions, and theorems. 

7. Make slight variations in laws and see whether the results are still 
laws. This will help you see why the laws are stated in a particular way. 

8. Check every step in proofs, actually writing out every reason in full 
and performing the indicated substitutions. 

9. Rewrite incomplete proofs by inserting omitted steps. 
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10. Rewrite complete proofs in briefer form. 

11. Note the key ideas involved in proofs. 

12. Where no proof is given, try to construct one if time permits. 

Note that the check list concentrates on experimental activities and 
intuitive thinking. It is such activities that lead to an appreciation of 
ideas and lay the basis for precise logical formulations and proof. As 
George Sarton writes in his The Study of the History of Mathematics (p. 19), 
“The ways of discovery must necessarily be very different from the short- 
est way, indirect and circuitous, with many windings and retreats. It’s 
only at a later stage of knowledge, when a new domain has been suf- 
ficiently explored, that it becomes possible to reconstruct the whole theory 
on a logical basis, and show how it might have been discovered by an 
omniscient being, that is, if there had been no need for discovering it!” 
Mathematics is usually presented in this final logical form, but the learner 
must energetically experiment and think in order to understand its 
significance. 

It is no accident that the construction of a proof is listed last. Proof 
should be the last step. As Polya writes in How to Solve It, “If you have 
to prove a theorem, do not rush. First of all, understand fully what it 
means. Then check the theorem; it could be false. Examine its conse- 
quences, verify as many particular instances as are needed to convince 
yourself of its truth. When you have satisfied yourself that the theorem 
is true, you start proving it.” 


(a) Reread Note to the Student in the prefatory material. 


CHAPTER 2 
ELEMENTARY LOGIC 


2-1 Introduction. Logic may be described roughly as the theory of sys- 
tematic reasoning. Symbolic logic is the formal theory of logic. There are 
many kinds of logic, but we shall consider only the logic that is most 
commonly used in mathematics and other sciences. 

Symbolic logic has important applications in science and industry. In 
the New York Times of November 25, 1956, a well-known producer of 
electric products advertised for “men with ideas” to work in the field of 
electronics. The advertisement called for students of mathematics who 
had done “creative and original work in all fields of mathematics,” and 
who had “an interest in the theory of numbers, theory of groups, Boolean 
algebra and symbolic logic....” In the following chapters the reader will 
get some inkling of why a great corporation is interested in such matters. 

We introduce symbolic logic now for three reasons: (1) We can utilize 
its symbols and laws to simplify later work. (2) The axioms and proofs of 
elementary symbolic logic are simple and serve to illustrate the nature of 
a formal mathematical theory. (3) The laws and methods of logic will be 
useful to the reader in all his thinking in mathematics and other areas. 

The purposes of this chapter are (1) to familiarize the student with the 
most important concepts and notations of symbolic logic, (2) to supply 
him with logical laws of wide applicability, (3) to develop his skill in read- 
ing formal mathematics, and (4) to apply logic to the algebra of real 
numbers. 


2-2 Some simple logical formulas. Logical formulas are sentences 
whose variables stand for propositions. The purpose of this section is to 
familiarize the student with the following logical formulas. 


Logical formula Informal verbal synonym 
(1) ~p It is false that p. 
(2) pA q p and ᾳ 
(3) pV q p and/or q 
(4) pV q p or else g 


We do not take the space constantly to suggest that the student carry out the 
operations listed in Section 1-17. Occasionally we ask questions, but the reader 
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should still refer to the check list and work over the material on his own. Item 
3 on the list indicates that sentences should be substituted for p and q in (1) 
through (4). (8) Carry out (p: I shall buy a car, 4: I shall sell my old car) in 
(1) through (4). (Ὁ) What is the difference between (3) and (4)? 


Negation. We call ~p the negation of p. It is the sentence that denies p. 
We read ~p as “It is false that p,” “p is false,” or “not-p.” The negation 
of a sentence is the sentence that is false when the original is true, and 
true when the original is false. We indicate this in table (5), in which 1 
means truth and 0 means falsity. 


] 
(5) | ——|—}— 


When a proposition is true we say that it has the truth value truth 
(represented here by 1), and when it is false we say that its truth value 
is falsity (represented here by 0). Then table (5) indicates in its first 
row the possible truth values of p and in the second row the corresponding 
truth values of ~p. It shows that p and ~p always have opposite truth 
values. 

To find the negation of a sentence, we must find a sentence that con- 
forms to both columns of (5). For example, “He is a good hunter” and 
“He is not a good hunter” are each the negation of the other, because 
if one is true the other is false. However, “He is a bad hunter” is not the 
negation of “He is a good hunter,” because both might be false (if he is 
not a hunter at all!). 


(c) Why, in view of the meaning we attach to “proposition,” does (5) repre- 
sent all possibilities? (d) Why is “It is white” not the negation of “It is 
black”? (6) Express ~p in several ways if p = (z is happy). (ὃ Why is 
“~3” nonsense? (6) How do we usually express ~(a = δ) 


Conjunction. We call p A q the conjunction of pand q. It is the sentence 
that asserts both p and ᾳ. We read it as “p and g,” “Both p and gq are true,” 
or “p is true and q is true.” The conjunction of p and q is true when both p 
and g have truth values 1, and it is false otherwise. This is indicated in 
table (6), which gives the possible combinations of truth values of p and q 
in the first two rows and the corresponding truth values of p A q in the 
last row. | 


(6) 
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(8) True or false? (2+ 2 = 4) A (“3” is a name of 8. (i) What is the 
scope of A in (h)? (j) Why is (2 = 4/2) A (0 = 1) false? (k) Why 
is “2 A 5” nonsense? 


Disjunction, the inclusive “or.” The word “or” in English has two dis- 
tinct meanings. The exclusive “or” means “or else,” as illustrated by 
“Hither the Repubocrats will win or (else) I’ll eat my hat!” The inclusive 
“or” means “and/or,” as illustrated by “I intend to study French or (and/ 
or) German.” The inclusive “or” means one or both of the two possibili- 
ties; the exclusive “or” means one but not both. It turns out that the 
inclusive “or” is more frequently used in mathematics, and we adopt for 
it the symbol V. 

We call p V gq the disjunction of p and q. It is a sentence that asserts 
that at least one of p and q is true. We read it as “p or q,” “p and/or q,” 
or “p or else g or else p and q,” according to convenience. Table (7) shows 
how the truth value of p V gq depends on the truth values of p and of gq. 


(7) 


(1) True or false? (2 = 10/5) V (2 = 5). (m) True or false? (George 
_ Washington was our first president) V (John Hancock signed The Declaration 
of Independence). (n) Why is “George V Mary” nonsense? 


Often we wish to state that one number, a, is less than or equal to an- 
other, b. We write a < ὃ, defined as follows. 


(8) Def. [fa < b] = (a « δ) V (a = ὃ). 
(0) Why is 2 < 3? (p) Why is 2 < 2? (q) Define a > ὃ 


Exclusive disjunction. The exclusive “or” is not used as much as V, but 
it is occasionally convenient, and for this reason we include it here. We 
call p V gq the exclusive disjunction of p and q. It is the sentence claiming 
that one and only one of p and q is true. We read it “p or else q,” “One 
and only one of the following is true: p or 4," or “p or g but not both.” 
Table (9) indicates the way in which the truth value of p V q depends on 
the truth values of p and of gq. 


2-2] | SOME SIMPLE LOGICAL FORMULAS 73 


(9) 


(r) Why is “up V down” nonsense? (8) Under what conditions is pv q 
true but p V q false? (Ὁ) Can p V gq be true and pV ᾳ false for the same 
truth values of p and α (ἃ) Compare the meaning of “I will study mathe- 
matics V I will study physics” with the same sentence with (V:V). 


“a 


Definitions. The logical symbols introduced above are not independent. 
We can take some of them as undefined and use these to define the others. 
For example, we note from (7) and (6) that p V ¢ is false only when p and 
q are both false, that is, when and only when ~p A ~q 1s true. Hence 
p V q is true when and only when ~p A ~gq is false, that is, when 
~(~p A τὸ is true. This suggests the definition 


(10) Def. lp V gq) = ~(~p A ~9Q). 
Similarly (9) and (7) suggest 
(11) Def. PY I=PVQAW~DA ®. 


(v) In (10) and (11) what symbols are taken as undefined? (w) Justify (11). 


PROBLEMS 


Translate 1 through 6 into words. 


1, ~(2 > 3). 2.(3 < 4) A 4 < 5). 
3. (t = 3) V (ὦ = 4). 4. (zr = 3) V (« < 0). 
5. (« < 0) V ὦ = 0) V @ > 0). 6. ~[¢ <0 A xz > Oh. 


Translate 7 through 14 into symbols, using ~, V, A, V. 
7. 3 does not satisfy x? = 10. 
8. 2 1s less than 10, and 2 is a digit. 
9. (1-13-16). 
10. 27 = 2-2, but 3? ~ 2-3. 
11. One and only one of the following holds: a < b,a = b,a > ὃ. 
12. [8 = 5, yet |—5| γέ —5. | 
13. 3 < 4, however —4 < - 3, 
14. 0 is neither positive nor negative. 


Suppose that p is true and q is false. Determine the truth value of 15 
through 22. 
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15. ~q. 16.~p V q. 
17. ~(p V ~@). 18. pA ~@. 
19.~p V ~@. 20. ~(p V 4q). 
21. pV q. 22. ~p V q. 
23. Let D be the truth value of p. Then show from the truth tables that 
(12) ~p = 1 —?p, 
(13) PNG@=P'G 
(14) pVq=DPrai—-D'G 
(15) p¥q= (|p — YW. 


* 24, Suppose we take ~ and V as undefined. Define A. 
#25. Define V and A in terms of ~ and V. 


ANSWERS TO EXERCISES 


(a) (1) It is false that I shall buy a car. I shall not buy a car. (2) I shall 
buy a car and sell my old car. (3) I shall buy a car and/or sell my old car. 
(4) I shall buy a car or else I shall sell my old car. (Ὁ) The first means either 
or both, the second one but not both. (6) A proposition is either true or false 
but not both, by definition of “proposition.” (4) Both might be false. (6) 2 
is not happy; it is false that z is happy; not-(z is happy); (ὦ is happy) is false. 
(f) Because “3” is not a sentence and hence not a significant substitute in (1). 
(g) a ~ ὁ. 

(_h) 1. (Gi) “2+2 = 4” on left, “3’ isa name of 3” onright (2) See (6). 
(k) “2” and “5” arenotsentences. (1) 1. (m) 1. (n) “George” and “Mary” 
are not sentences. (0) 2< 3. (p)2=2. (q) [a > ὃ] = ~la < ὃ] or 
la > Ὁ] = (a > b) V (a = ὃ). (xr) “Up” and “down” are not sentences. 
(s) When p and q are both true. (t) No. (ἃ) First means either or both, 
second either but not both. (v) ~ and ΔΛ. (w) It says that p V q means 
that p is true or g is true but not both, which is what (9) Indicates. 


ANSWERS TO PROBLEMS 


1. 2 is not greater than 3. 3. z is 3 or 4. 5. One and only one of the following 
holds: x <0, 2 =0,2>0. 7. ~(8? = 10). 9 ἵν = Vz] = fy = aA 
y > 0]. 12. [5] = 5A |—-5| γέ —5. 15. 1. 17. 0. 19. 1. 21.1. 28. From 
truth tables by considering all cases; or show (12) and (13), then use (10) and 


(11). 


2-3 Implication. Sentences of the form “If p, then q’’ are very common 
in scientific discourse. The if-then idea is expressed in many ways, of 
which the following are synonymous examples. 


(1) If p, then gq. 
(1’) q if p. 
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(2) p implies gq. 

(27) qg is implied by p. 

(3) p only if g. 

(4) Hyp: p, Con: gq. 

(5) p is a sufficient condition that q. 
(6) q is a necessary condition that p. 


In everyday discourse such expressions are used with various meanings 
and connotations. In scientific discourse these sentences are synonymous 
and have a precise technical meaning. In mathematics a special symbol 
is usually used, the most common being an arrow pointing from the 
hypothesis to the conclusion. We shall adopt this notation and write 


(7) pq 


as a synonym for sentences (1) through (6). 

The meaning we attach to p — ᾳ is indicated by the formal definition 
(12) below. To prepare for the definition we indicate the significant 
substitutes in (7), the conditions under which it is true or false, and the 
nature of the information that it may convey. 


The expression “p — q” is a propositional formula in which 
(8) significant substitutes for the variables are sentences and only 
sentences. 


(a) What does p — g become if for the variables we substitute statements? 
(Ὁ) numerals? (6) names of people? (d) Why is every value of p — ᾳ either 
true or false? 


A statement of the form p — ᾳ is considered true under any one of the 
following three conditions: 
p is true, and q is true. 
(9) p is false, and q is true. 
p is false, and q is false. 


It is considered false only in the following case: 
(10) p is true, and q is false. 


A statement of the form p — 4 makes the claim that one of the three 
possibilities listed in (9) is the case, but ἐξ makes no other claim. In particu- 
lar, it says nothing as to whether p is true or false, as to whether g is true 
or false, as to the meanings of p and gq, or as to the relation between these 
meanings. 
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We may summarize the above in terms of a table for p — 4. 


(11) 


Which of the following are true? (6) (2 = 2) -" (ὃ = 8), (ἢ (2 = 3) > 
(4<1), @) @=3)~-4>), (ἡ) ὦ = 2) -» ὦ = 8). | 


We note from (11) that p — q is false just in the one case when Ὁ is 
true and q false. In other words, Ὁ — q is false just when p A ~q 15 
false, that is, when and only when ~(p A ~g@) 15 true. This suggests 


(12) Def. (prog=~Dpa --φ. 


We call p — q the conditional of p and q: The special technical meaning 
assigned to p — q [and to the synonymous expressions (1) through (6)] 
by (12) may not seem entirely natural to the reader. He may think that 
we are not following the criteria of (1-13-9). It turns out that this idea 
of implication is entirely satisfactory for scientific purposes, is more in 
keeping with ordinary usage than first appears, and is more convenient 
than any alternative yet proposed. However, other kinds of implication 
are considered by logicians, and the one defined by (12) is called material 
implication to distinguish it from the rest. 


For each of the following, first decide whether it is true or false on the basis 
of the everyday meaning, then decide the same question on the basis of (11): 
(i) If the ocean is mostly water, then it contains about twice as many hydrogen 
~ atoms as oxygen atoms. (Note: We assume that the formula for water is H20, 
1.6., each molecule of water contains 2 hydrogen atoms and one oxygen atom.) 
(j) If the ocean is entirely grade-A milk, then it contains about twice as many 
atoms of hydrogen as atoms of oxygen. (Κ) If the ocean is entirely grade-A 
milk, then’ocean water is a nourishing beverage. (1) If the ocean is mostly 
water, then ocean water is a nourishing beverage. 


The previous exercises serve as examples to indicate that unless we per- 
mit Ὁ — gq to be true under any one of the three conditions of (9), we shall 
find a marked contradiction between our technical meaning and ordinary 
usage. 

A still further illustration of the advantages of (9) is the way in which 
it facilitates the statement of laws. Consider, for example, the following 
law of elementary algebra (to be proved in Section 2-6): 


(13) (a = b) — (ca = cb). 
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We certainly are inclined to agree that this is a law, 1.6., that if a = ὃ, 
then ca = be. Indeed, this is the law suggested by “If equals be multiplied 
by equals, the results are equal.” 

Now we recall that a law is a sentence that is true for all significant 
values of its variables. Since in (13) any number is a significant value of 
any of the variables, we are free to consider the following cases: 


(14) [2 -- (6—4]—>[3-2=3-(6 — 4] (13)(a:2, b:6 — 4, 6:8), 


(15) (2=3)>5(0-2=0-3) (13)(a:2, b:3, 6:0), 


| 


(146) (2= 3) > (5-2 = 5-83) (13) (a:2, 6:3, ¢:5). 


If (13) is a law, then (14) through (16) must each be true. In (14) both 
hypothesis and conclusion are true, and hence it is true by the first case in 
(9). In (15) the hypothesis is false and the conclusion true, and hence it 
is true by the second case. In (16) the hypothesis and conclusion are both 
false, and hence it is true by the third case in (9). We cannot find an ex- 
ample for which the hypothesis is true and the conclusion false, because 
(13) isa law. We see that unless we agree that p — q is true in all three 
cases in (9), we shall be unable to say that (13) is a law! The same is true 
of many other laws of the form p — 4. 


(m) Discuss the law (α = δ) — (a? = 6?) as we did (13). 


In discussing and using the implication concept and the symbol —, we 
shall make use of the different synonyms listed in (1) through (6). For 
this reason and because these synonyms appear very frequently in scien- 
tific discourse, it is important to be able to translate from any one form 
into any other, and particularly to and from the form p — gq. The essen- 
tial thing is to think of the meaning and to recall that a hypothesis or 
sufficient condition is always at the heel of an arrow, whereas a conclu- 
sion or necessary condition is always at the point of an arrow, as indicated 
in Fig. 2-1. This reflects the fact that one argues from hypotheses (suf- 
ficient conditions) to conclusions (necessary conditions). 


Sufficient Necessary 
condition ceeemeienmnamies ial condition 
FigurRE 2-1 
Translate into each of the forms (1) through (6): (n) (13). (0) The sen- 
tence of Exercise (i). (p) If two triangles are congruent, they are similar. 


We call g — p the converse of p — q. It is the sentence obtained: by 
interchanging hypothesis and conclusion in p—q. We call p © q the 
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biconditional of p and q. It is the sentence that claims that p — q and 
q — p; that is, p — gq and its converse are both true. 


(17) Def. [Ρ «» 4] ΞΞ [(ῷ -- 4) A (ᾳ --Ὁ 5)}. 


T'rom this definition and (11) we easily get table (18): 


(18) 


There are many ways to read p «Ὁ ᾳ. One is “p — gq and conversely,” 
and in this we may replace p — q by any one of the readings indicated in 
(1) through (6). Other forms are “p if and only if q,” “g if and only if p,” 
“p is a necessary and sufficient condition for g,” “p has the same truth 
value as 4," and “p is logically equivalent to g.” The last form refers to 
the equality of the truth values of p and g when p < gq. 

(q) True or false? (2 = 4) <> (3? = 2+ 2), (2 = 2) +> (2 = 38), (24 = 
2-12) <> (2 = 2), (1 = 2) ( = 8. (Ὁ Why is “2<> 2” nonsense? 
(s) Give an example from geometry of a law whose converse is not a law. 


(t) What conclusion can be drawn if (p <> 4) is true and gis true? (u) and pis 
true? (v) and pis false? (w) and ᾳ is false? 


PROBLEMS 


In Problems 1 through 4 cite cases corresponding to each possibility in (9). 


1. If ABC is an equilateral triangle, then ABC is isosceles. (Draw figures 
for each case.) 


(x = y) > (—2 = —y). 
. (AB || A’B’ A AC|| 4΄6) — (ZBAC & ZB’A'C’). 
(x2 = yy?) ~[@=y) V ὦ = —y)]. 


In Problems 5 and 6 both the theorem and its converse are true. Cite cases 
to illustrate the possibilities. Why do you find only two? 


5. (ABC = 90°) — (AB? + BC? = AC?) (The Pythagorean theorem). 
6. (a = ὃ) ~> (atc = b+ ὁ). 


*7. How are the following consistent with our definition of implication? “If 
to do were as easy as to know what were good to do, chapels had been churches 
and poor men’s cottages princes’ palaces.” (Merchant of Venice, Act 1) “If 
all the year were playing holidays, to sport would be as tedious as to work.” 
(King Henry IV, Part 1, Act 2) 
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Translate 8 through 23 into symbolic language. 


8. There can be no great smoke arise, but there must be some fire. 

9. There is no fire without smoke. 
*10. “The existing economic order would inevitably be destroyed through 
lawless plunder if it were not secured by force.” (The Law as a Fact, by 
K. Olivecrona, p. 137) 

11. 3 is a root of z? = 9. 

*12. In a good group you can’t tell who the leader is. 
*13. “In order for a country that imports capital goods to have a high rate 
of investment, it must have a large export industry.” (A Survey of Con- 
temporary Economics, Vol. II, p. 156) 

14. “When an organism is conditioned to respond to one stimulus, it will 
respond in the same way to certain others.” (Principles of Psychology, by 
F. 8. Keller and W. N. Schoenfeld, p. 115) 

*15. “...an adequate command of modern statistical methods is a necessary 
(but not sufficient) condition for preventing the modern economist from produc- 
ing nonsense... .” 

*16. “He is well paid that is well satisfied.” 

* 17. I mean what I say. 

* 18. I say what I mean. 

*19. “The surface of the leaf must be coated so as to prevent evaporation of 
the water that has been so laboriously gathered by the root system.” (H. E. 
Stork, Studies in Plant Life, p. 36) 

*20. You will make 6% provided the dividend is paid. 

* 21. He’ll win as long as he’s better. 

22. “It has been shown... that the equality is necessary if the third and 
fourth marginal conditions are not to be violated.” 

23. It is known that the totally blind are able to detect objects at a distance. 
Although the blind often think they possess “facial vision” based on skin sensa- 
tions, experiments have shown that such clues are neither necessary nor sufficient 
for the blind’s perception. 


*24. Restate each of Problems 8 through 23 in terms of necessary conditions. 
*%25. Restate 8 through 23 in terms of sufficient conditions. 

* 26. Explain the following: “The argument that no clear and present danger 
to American democracy now exists inside the country should not be taken to 
mean that no group would constitute a danger if it were powerful. For that would 
be a confusion of necessary and sufficient conditions.” (R. G. Ross, “Democracy, 
Party, and Politics,” Ethics, January 1954) 

* 27. One of the symptoms of tuberculosis is persistent coughing. Is coughing 
a necessary or a sufficient condition for tuberculosis? What can you say about 
medical symptoms generally? 

*% 28. Read “What does ‘if’ mean?” in the Mathematics Teacher for J anuary 1955. 
*29. Does p — 4 mean that gq follows p in time? 

*&30. Speaking of gamblers, Cardano (1501-1576) wrote, “If a man is vic- 
torious, he wastes the money won in gambling, whereas if he suffers defeat, 
then either he is reduced to poverty, when he is honest and without resources, 
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or else to robbery, if he is powerful and dishonest, or again to the gallows, if 
he is poor and dishonest.” Formalize and draw a conclusion. (Oystein Ore, 
Cardano, The Gambling Scholar, p. 187) 


ANSWERS TO EXERCISES 


(a) A statement. (Ὁ) Nonsense. (c) Same. (d) By (8), and see Sec- 
tion 1-7. (e) T. (f) T. ΩΤ. (h) F. (i) True, as is evident by 
arguing from the hypothesis to the conclusion; here p and 4 816 true. (j) True, 
as above, since milk is mostly water; here p is false, and gis true. (k) True, 
since milk is nourishing; here p and q are false. (1) False. In each of the 
first three cases the conclusion follows from the hypothesis by everyday rea- 
soning. It does not in the last, since many liquids containing mostly water are 
not nourishing. Here p is true and q is false. (m) (@:2, 6:2), (a:2, 6:—2), 
(a:2, 6:3). 

(n) If a = ὁ, then ac = be. a = ὃ implies ac = be. a = ὃ only if ac = be. 
From a = ὃ, it follows that ac = be. Hyp: a = ὃ, Con: ac = be. a = bisa 
sufficient condition that ac = bc. ac = be is a necessary condition that a = 6. 
(0) The ocean is mostly water — it contains.... The ocean is mostly water 


implies it contains .... The ocean is mostly water only if it contains... 
From the ocean is mostly water it follows that it contains.... Hyp: The ocean 
is mostly water. Con: It contains.... The ocean is mostly water is a suff- 


cient condition that it contains.... (Note: The best procedure in performing 
translations of this kind is to first write the expression in the form p — 4, being 
careful that p and q are sentences. Then write in other forms and make whatever 
adjustments are required to conform to good English usage.) (p) A sufficient 
condition that two triangles be similar is that they be congruent. A necessary 
condition that two triangles be congruent is that they be similar. (α) T, F, 
T, F. (δ “2” is not asentence. (t) p. (ἃ) g (v) ~g. (Ww) ~p. 


ANSWERS TO PROBLEMS 


1. Cite an equilateral triangle, one that is isosceles but not equilateral, and 
a scalene triangle. 3. Cite congruent angles, placed with their sides |, con- 
gruent angles placed otherwise, and noncongruent angles with sides not ||. 


A 
B 6 Β C 


FIGURE 2-2 
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5. When the converse is also true we have only the possibilities of both hy- 
pothesis and conclusion true or both false by (18). Figure 2-2 shows two tri- 
angles ABC. In the first, the hypothesis and conclusion are both true; in the 
second they are both false. Hence, unless we wish to deny that the Pythagorean 
theorem is a law, we must agree that p — q is true when both p and q are false 
as well as when both p and q are true. This is an example of the convenience of 
the property of — embodied in the last column of (11). Of course, we do not 
care to apply a theorem when its hypothesis is false, but the meaning assigned 
to — and its various verbalizations makes the stating of laws much simpler. 

9. There is fire — There is smoke. Fire is a sufficient condition for smoke. 
(Note here a violation of the agreement that only sentences be substituted for 
variables in p — g. However, this is really an abbreviation for “A sufficient 
condition that there is smoke is that there is fire.”) 11. (c¢ = 3) > (x? = 9). 
13. A country importing capital goods has a high investment rate — It has a 
large export industry. 27. Some are necessary but not sufficient (always present 
with the disease but also present at other times), some are sufficient but not 
necessary (their presence always indicates the disease but they may not: always 
accompany it), some are both, and others are neither. 


*2-4 Truth tables. A truth table of a logical formula is a table that 
shows the truth value of the formula that corresponds to each combina- 
tion of truth values of its variables. Truth tables are not essential to the 
development of logic, but they are a convenient tool for investigating logi- 
cal formulas and provide a procedure for testing any logical formula to 
see whether it is a law. If we set up truth tables as our criteria for es- 
tablishing laws in logic, we could use them as a device for proof. We 
preter to work with axioms instead and to use tables as an informal device 
outside the theory. 

For example, is p — (p A 4) a law? To answer the question we make 
table (1): 


(1) 


p—(p A 4) 


The third row comes from (2-2-6). The fourth row is obtained from 
the first and third by reference to (2-3-11). We see that the formula is 
not a law, since it is false when p is true and q false. 

To be able to compare the truth tables of different formulas it is es- 
sential to adopt a standard form of construction. To this end we always 
write the first rows in the following way. If one variable is present: 


82 ELEMENTARY LOGIC ἴσηαρ. 2 


(2) 


If two variables are present: 


(3) 


(4) 


Note that each of these tables is formed from the previous one by re- 
writing each column of the previous one twice, the first time with a 1 
below it and the second time with a 0 below it. For example, the first 
column of (3) reads 1, 1; and the first two columns of (4) read 1, 1, 1 and 
1,1,0. The second column of (3) reads 1,0; and the third and fourth 
columns of (4) read 1, 0, 1 and 1, 0, 0. 


(a) Using the indicated procedure, write the first four rows in a truth table 
involving four variables. ‘ 


In Section 2-5 we assert a number of identities, each of which claims 
that two logical formulas are synonymous. If two logical formulas are 
synonymous, certainly both should be true or both false in any given in- 
stance. In other words, their truth tables should have the same entries 
in the last row. In symbols, (p = 4) — (p «Ὁ 4). However, the con- 
verse is not true, since p <> q whenever p and qg have the same truth 
value, and this may happen without p and gq being equal. 


(b) Give an example of the observation in the preceding sentence. 


From the above discussion, we see that we can test an alleged logical 
identity by making truth tables for its members. If their last rows are 
different, we know the equation is not an identity. If the last rows are 
the same, we have verification but not proof of the identity. 


Verify the following by truth tables: (c) (2-5-2), (d) (2-5~4), 
(6) (2-5-7), (ὃ (2-5-8), (g) (2-5-23), (ἃ) (2-5-28). 


The truth table of a law of logic ought to contain only I’s in its last row. 
Consider, for example, the formula p V ~p. Its table is (5). 
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(5) 


Evidently this formula is a law. If we read it verbally, we have “p or 
not-p,” “p is true or not-p is true,” “p is true or false,” or “any proposition 
is either true or false.” We prove this in Section 2-7. 


Make truth tables for the following laws: (i) (2-7-5), 4) (2-7-9), 
(k) (2-7-23), (i) (2-7-12). ; 

In the following, change = to <> and make a truth table of the result: 
(m) (2-5-6), (0) (2-5-15), (0) (2-5-24), 

Show that the following are not laws and indicate the truth values for which 
they fail: (Ὁ) (P>g) > (>p), (4) P>9D > (p> ~9, 
(r) (p> 9g) A ~p) > ~4. 


A logical formula that is a law, or a sentence in the form of such a for- 
mula, is called a tautology. The negation of a tautology is called a con- 
tradiction. | 


(s) Show that p A ~p is a contradiction. (() Show that p—-~p is not 
a tautology. Show that it is not a contradiction! (u) Why is “I went or I 
did not go” a tautology? (v) Why is “I went and I did not go” a contradic- 
tion? (w) Show that the following is a tautology: It is snowing in Denver and 
it is not raining in Nashville, or it is not snowing in Denver and it is hailing 
in Kansas, or it is not hailing in Kansas, or it is raining in Nashville. 


Whenever a statement is in the form of a tautology we know that it 
must be true without further consideration of the truth of its parts. 
Surprisingly often people try to prove a tautology by arguments about its 
terms, without realizing that this is not necessary. Consider, for example, 
the following quotation from a newspaper editorial. ‘There may be 
justification for a subsidy—defense needs, for example—but it should be 
clearly understood that it is never economically justifiable. So the reasons 
for a subsidy must be strong enough to override the drawbacks.” This 
sounds as though the second sentence follows from the first. But the second 
sentence is a disguised tautology. It says that if a subsidy is a good thing 
its advantages must outweigh its disadvantages, 1.e., subsidy is justified > 
reasons for it must override its drawbacks. But what do we mean by say- 
ing that anything is justified except that the reasons for it override its 
drawbacks? In other words, by definition, x is justified = reasons for x 
override reasons against x. Hence the second sentence is logically equiva- 
lent to p — p, with p = the subsidy is justified, and is therefore true 
quite independently of the first sentence. 
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PROBLEMS 


1. Make truth tables for other laws in Sections 2-5 and 2-7. 
Which of 2 through 6 are laws? 

2.~(p Vg) (~p V “φ. 

BDA @d@VnlelpA a Ν ἡ. 
4,.(p>Qg po pA Ql. 

5. (Ὁ -- 4) - [8 > (p > Q)I- 

6  -Ὁ φὶ' -ο ἷῷ Λ 7) - (ᾳφ Λ ri. 

Ἵ 
8 


, Make a truth table for (2-7-39). Show that its converse is not a law. 
. Experiment with other logical formulas. 
9. Why is this a disguised contradiction? “Our speaker was born in Man- 
hattan, the first of his family to venture north of the Mason-Dixon line.” 
*%10. Is the following verse by John Donne a tautology? 


I am unable, yonder beggar cries, 
To stand, or move, if he say true, he lies. 


11. Is an argument based on a tautology necessarily a poor argument? 

12. Ogden Nash once wrote, “I regret that before people can be reformed 
they have to be sinners.” Why is the clause beginning “before...” true? 

13. Why is “Either an electron is excited to the fullest eaent or it is not 
excited at all” not a tautology? Why would it be a tautology if “bo the fullest 
extent” were deleted? 

14. How many columns are there in a truth table for a formula involving n 
variables? 


ANSWERS TO EXERCISES 


(Ὁ) (p:2 = 2, 9:3? = 


(e) 


Note how we include an additional row for each formula that is a part of the 
final formula, then calculate each row in order. 
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(k) 


(p) Fails when q is true and p false. (q) Same. (Ὁ) Same. (w) If p = it 
is snowing in Denver, g = it is raining in Nashville, r = it is hailing in Kansas, 
then the sentence is (p A ~g) V (~p A r) V ~r V gq, which is easily shown 
to be a tautology by a truth table. 


ANSWERS TO PROBLEMS 


4, 5, and 6 are laws. 7. The converse fails for g true, p and r false. 


2-5 Logical identities. In this section we derive a number of useful 
identities from a few very plausible axioms. We let A, ~, and = be un- 
defined and define V, V, —, and + as in Sections 2-2 and 2-3. We specify 
that the following are sentences: a = b, ~p, pA 4, pV ᾳ, pV ᾳ, 
p — 4, p «Ὁ q, and expressions obtained by substituting for their variables 
other variables or sentences. 


(1) Ax. = 2 (Law of identity), 
(2) Ax. pN\q=aAp, 

(3) Ax. PN@AN=PAQAT, 

(4) Ax. PA@VnN=PAQDV DPAr), 
(5) Ax. pNp=pP, 

(6) Ax. ~~p = p. 


(a) Make the substitution (p:You can walk, g:You can swim, 7:You can fly) 
in (1) through (6). (Ὁ) What is the scope of each ~ in (6)? (ὁ) Suggest 
names for (2), (3), and (4), 


In building any mathematical theory, we can list laws in many different 
orders, provided, of course, that each proof uses only previously estab- 
lished laws. The order here has been chosen to make the proofs as short 
as possible and to bring out the similarity of the properties of V and Λ. 


(7) ~pAQ=n~pV ~¥, 


(De Morgan’s laws) 
(8) ~(p VQ =~p A 4. 
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To prove Eq. (7) informally we note that (2-2-10)(p:~p, g:~@q) is 

Hence ~p V ~¢ = ~(p A 4). In more formal style, 

(9) ~pAQgQ=~(~~pA~~g  ῳ(β):4) 

(10) =~pVv ~@ (2-2-10) (p:~p, q:~q). 
Similarly, (8) is proved by 

(11) “ὧν 4 -Ἦ οἰ» A ~g] (22-10) 


(2) 


~p A “ἃ (6)(p:~p A ~9). 
(d) Rewrite this proof informally. 


Laws (7) and (8), named after the English mathematician Augustus 
De Morgan (1806-1871), are useful in proving other laws and in stating 
the negations of compound sentences. The first law states that to deny a 
conjunction is to affirm the disjunction of the negations of its terms. 
The second states that to deny a disjunction is to assert the conjunction 
of the negations of its terms. To apply them, one must first state a com- 
pound sentence in the form p A qorp V gq. For example, John and Jim 
are here = (John is here) A (Jim is here). Hence the negation is (John 
is not here) V (Jim is not here), i.e., (John or Jim is not here). 


Simplify: (6) ~(We won and we are happy). (ὃ ~(We won or we are 
happy). (6) ~(John and Jim are 17). 

State the negation of the following in two ways, first using logical symbols 
and then using familiar English: (h) John should honor his father and mother. 
(i) He is neither rich nor poor. 


(13) pVq=aV BP, 

(14) ρν (φν ) τ- (ν ὡὴ Ν Υ, 

(15) »ν(ᾳληξίν 4) Δ ᾧ ν ἢ), 
(16) pV p= PD. 


Remember Section 1-17! (7) Interchange V and A in (18) through (16) 
and state your conclusion in words. (k) What must be the key to proving 
(13) through (16)? (1) Apply (V:+, A::,~:—) to (1) through (6) and 
(13) through (16) and state your conclusion in words. 
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To prove (13), we write 


(17) pVq=n~(~pA~q (2-2-10) 
(18) = ~(~¢ A ~p) (2) (p:~p, g:~q) 
(19) =qVp (2-2-10) (p:q, g:p). 


(m) Give reasons in the following proof of (16). 


(20) pV p= -~(~p A ~p) 
(21) = ~(~p) 
(22) = p. 


The others [(14) and (15)] are proved similarly. 

From the above identities it is easy to derive many others. We list below 
those that are most frequently used or are required for later proofs in this 
book. 


(23) p-q=n-~pv ᾳ, 
(24) ~p-g=pA~~q| 
(25) “ῷῷ «5» 8--(»Λ ὧν GA “φ. 


Because of (3) and (14), parentheses are usually omitted when only 
V or A are involved. 


(26) ~pAQArn=a~pV “μῶν ον, 
(27) ~pVaVrn=n—pA “GA ~r. 


State the negation of: (0) If we lose, I’ll eat my hat. (0) All three of 
our teams won today. (p) At least one of our three teams won today. (4) He 
will graduate if and only if he passes this test. (r) “There’s neither honesty, 
manhood, nor good fellowship in thee.” (King Henry IV) 


(28) p-q=n~qd-~p. 


Since ~g — ~p is called the contrapositive of p — q, (28) asserts that 
an implication and its contrapositive are synonymous. To prove it, we 
note by (23) that (p > 4) = (~p V 4) = (VV ~p) = (αν ~p) = 
(τς — ~p), the last step being justified by (23)(p:~q, q:~p). 


(s) Write out the preceding proof with reasons. (Ὁ) Illustrate (23) for 
(p:He wins, 4:1 lose). | 
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To prove an implication, we very often begin by assuming the negation 
of the conclusion and arguing to the negation of the hypothesis. This 
procedure is completely justified by (28). For example, in economics 
there is a theorem to the effect that when resources are allocated so as to 
yield maximum output, the marginal product (the additional output that 
would result from an additional unit of input) is the same for all enter- 
prises. In symbols, Output is maximum — Marginal products are equal. 
The economist argues for this as follows. Suppose that marginal products 
were not equal. Then an additional unit of input in one enterprise would 
result in greater output than would be lost by removing this unit from 
another location. Since output can be increased by shifting resources, it 
is not maximum. What the economist has proved by this argument 15 that 
~ (Marginal products are equal) — ~ (Output is maximum). 


(u) If no tracks are observed in snow we usually conclude that no one has 
passed by. Why is this legitimate? 


The following are listed for future reference: 


(29) ρΡ.»4 - 4 “»Ρ, 
(30) [Ρ -- (ᾳ --Ὁ 5)] ξξ [ῷ Λ 4) -- ἡ], 
(31) ΙΡ --" (ᾳ --Ὁ 7)] Ξξ [ᾳ -- (ῷ --Ὁ rl, 
(32) ΙΡ -- (ᾳ -Ὁ 7)] = [r-@- “α)], 
(33) [ῳ NM - »] ΞΞξ [ᾳ -" ῷ V ~p)I, 
(34) Ι -- (ᾳ -Ὁ|» Δα 4]}} ξ [πῸὸΖο Δ ὧν DA OI, 
(35) [ῳ AQ -ῦ Ξ (pA ~r) > “αὶ, 
(36) lpg) A wor) ΞΞ lo (ᾳ Ν Ὁ). 
Rewrite in as simple form as possible: (v) Ὁ» -ὸᾳ, (w) (ᾧ» -οὺν 
(~p — ~9), (x) ~(p A ““4), (y) ~Ip A (@ V η)], (z) ~(~p V 


~(~p V ~(~a V ~5))). 


PROBLEMS 


Derive the identities in Problems 1 through 6. 


1. ““ἬΒΈ ΞΡ ΟΡ. 2. ~p \ ~p = ~D. 
38. ~(~p9 AQ =pV ~¢. 4,.~(pV “ὦ = ~PDAG. 
5. (28). 6. [p > ~q] = [ᾳ > ~p). 


_ In Problems 7 through 17 write the negation in good English after first writing 
it in logical symbols. 
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7. Lam rich and happy. 
8. John or Jim is mistaken. 
9. Mathematics and physics are sciences. 
10. Neither of you is right. 
11, If it rains today, I’ll stay home. 
12. One and only one of you may go. 
13. 7 and 14 are primes. 
14, John, Jack, and Jim are 17. 
15. New York is a big city if and only if Chicago is a big city. 
16. If he wins, I can’t lose. 
17. Both countries are at fault. 


18. Assuming that if one cannot do mathematics he cannot be an engineer, 
show that if one is able to be an engineer he must be able to do mathematics. 

19. Argue that if an enterprise is engaging in various activities in such a way 
as to maximize its profits, the additional revenue from increasing expenditure 
by one unit is the same in all activities. 

20. Provea Κέ 0 > —a = 0. 
*21. Investigate the properties of V , looking for laws involving it. Is it associa- 
tive? Distributive over V ? What is the negation of p V αἵ Show that p V q = 
pr ~?d. 
* 22. Start with — and ~ as undefined and define the other logical symbols in 
terms of them. 
* 23. Suppose we take p/q as our only basic term, where we take it to mean that 
it is false that both p and gq are true, that is, p/¢g = ~(p A q). Make a table 
for p/g as we did for other formulas in Section 2-2. Then define all other 
formulas in terms of “/”. (Note: This is called the stroke and is of interest be- 
cause it shows that all logical formulas can be defined in terms of just one.) 
Work out some laws involving the stroke. 
*24. What does “p unless g” mean? Is it synonymous with ~q — p or with 
~ge> p? 
*& 25. Every logical identity involving only V, A, and ~ remains an identity 
if V and A are interchanged. Verify this in particular cases. Prove that it is 
true for all identities derivable from our axioms. 


ANSWERS TO EXERCISES 


(a) There are various possibilities; for example, one rendering of (2) is: 
To say that you can walk and you can swim is the same as to say that you 
can swim and you can walk. (b) ~pand p. (c) Commutative, associative, 
and distributive laws. (6) We did not win and/or we are not happy. (ἢ We 
did not win and we are not happy. (g) John and/or Jim is not 17. (8) John 
does not have to honor both his parents, or (John need not honor his father) V 
(John need not honor his mother). (i) He is either rich or poor. (j) The 
results are all laws of the theory. (k) The relation between Λ and V em- 
bodied in (2-2-10), (7), and (8). (1) All but (5), (15), and (16) are laws of 
elementary algebra. Evidently the algebra of logic has some similarities to the 
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algebra of numbers, but it ts different. (m) (2-2-10) (q:p), (5)(pi~p), (6). 
(n) We’ll lose and I won’t eat my hat. (0) At least one of them lost.  (p) 
None of them won. (4) He will graduate and not pass, or he will not grad- 
uate and will pass. (r) Thou has some honor, manhood, or good fellowship in 
thee, and possibly more than one of these virtues. (8) 


(37) PIES Ap V 4 (23) 

(38) =qV-~p (13)(p:~p) 

(39) =~wn~g V ~?P (6) (p:@) 

(40) =~qg—7>~p ΕξΞ τς (28) (p:~4a, 4:9}). 


(u) (28). WW) pVq (whpVopVaVrg pra (YY) ~DV 
(~g A wr). (2) pA (ὧν (aA δ)). 


ANSWERS TO PROBLEMS 


1. (6)\(p:i~p). 3. (2-2-10)(q:~q). 5. Apply (2-5-7) to (2-38-12). 7. I am 
either not rich or not happy. 9. Not both mathematics and physics are sciences. 
Either one or the other or both are not sciences. 11. It rains today and I don’t 
stay home. 13. Either 7 or 14 is not prime. 1. New York and Chicago are 
big cities or they are both not big cities. 


2-6 Rules of proof. Everyone is familiar with the fact that what seems 
reasonable, even obvious, to one person may seem quite unreasonable 
and obscure to another. An argument that convinces some may not appear 
at all convincing to others. Nevertheless, in science, and particularly in 
mathematics, we wish to prove laws in such a way as to obtain universal 
agreement. How can this be done in view of the diversity of opinions and 
experiences of men? 

To see the answer to this question, let us imagine that we wish to con- 
vince someone of the correctness of a certain statement. We might try 
to convince him by citing some authority in which he believes (some- 
times called “proof by intimidation”), by getting him to “see ” (“proof by 
intuition”), by appealing to his emotions (“proof by waving the red flag”), 
by confusing and tricking him with words and faulty logic (“elastic in- 
ference”), or by other methods familiar to everyone. These methods have 
two serious disadvantages: they can be used to establish false statements 
just as easily as to establish true ones, and they do not lead to universal 
agreement. 

If we want a more satisfying procedure we must begin by assuming some 
axioms, as indicated in Section 1-16. But what if our listener does not 
agree that our axioms are truly laws? Is there no way out if he will not 
accept the axioms we propose? There 7s a way out; namely, we may ask 
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our listener to agree merely that 1 our axioms are indeed laws, then our 
theorems are also laws. If we take this tack, we eliminate argument about 
the truth of both the axioms and the theorems. We ask our listeners 
merely to agree that the theorems really do follow from the axioms! This 
is precisely the procedure we follow in mathematics. The mathematician 
claims only that the theorems follow from the axioms, and he has no ob- 
jection if someone prefers to adopt different axioms. 

But suppose that our listener objects to the manner in which we derive 
the theorems from the axioms? Then we shall have to come to some agree- 
ment with him as to what procedures are legitimate. And if we cannot 
agree with him on method of proof? Then we take the same way out as 
before. We say to him, “Let us merely agree that if these axioms are 
accepted and if these methods of proof are used, then these theorems can 
be obtained.” In constructing a formal mathematical theory we state in 
advance the acceptable methods of deriving theorems from axioms. Then 
all we claim is that the theory is derived from the axioms according to 
these rules. If the axioms and the rules are acceptable to anyone, the 
theorems should be also. If the axioms are applicable to any particular 
situation (i.e., are true in a particular case), the theorems may be applied 
to that situation. 

Our first two rules of proof are the Rule of Replacement and the Rule 
of Substitution introduced in Section 1-11 and used frequently since. For 
convenience of reference we recapitulate them here. 


RULE OF REPLACEMENT: If a term in an expression 18 re- 
(1) placed by a synonym, the resulting expression 18 synonymous 
with the original. If the original is a law, so 18 the result. 


To use the first part of this rule we need to cite the law that asserts the 
synonymity of the term and its replacement. Thus we write 


(2) a+b= —(—a)+b) Rep, (1-9-33). 


The reason indicates that a replacement, justified by (1-9-33), has been 
made in the left member to get the right member. To use the second part 
of the rule we must have two previously established laws, the law in which 
we intend to make the replacement and the law asserting that the replace- 
ment 15 a synonym of the term it replaces. We use the abbreviation “Rep” 
followed by a reference to the law in which the replacement is made and 
to the law asserting the synonymity. For example, 


(3) a+b=b+(a+0) Rep, (1-9-7), (1-9-2), 
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The reason on the right indicates that (3) was obtained by replacement in 
(1-9-7) of a by a + 0, which is a synonym according to (1-9-22). 


Rue or Susstitution: If a sentence is obtained by making 
(4) significant substitutions for the variables in a law, the result 8 
a law. 


When applying this rule we use the abbreviation “Sub” followed by refer- 
ence to the law in which the substitution is made and an indication of the 
substitution. For example, 


(5) (b+ cla = α(ὖ -Ἡ ο) Sub, (1-9-14)(a:b + c, b:a). 


Recall that substitutions are permitted only for variables and that they 
must be made throughout, whereas replacements may be made for any term 
and need not be made throughout. 


(a) Review Section 1-11. (Ὁ) Review previous uses of these rules. 


Our next rule is based on the meaning of implication indicated in Sec- 
tion 2-3. As indicated in (2-3-11), when p — 4 and p are true, q must be 
true. Schematically, 

pq 
(6) ; 


.- - -- 


οὖν 
We embody this idea in the following rule of proof. 


(7) RuLE OF INFERENCE: If an implication is a law and its hy- 
pothesis is a law, then its conclusion is a law. 

It is this rule that enables us to detach a conclusion and state it separately 

when we know that a hypothesis from which it follows is true. When using 

it we write “Inf” followed by reference to the implication and the hypothe- | 

sis. For example, from 


(8) 2= «Δ —-3-2= 374 ϑυῦ, (2-3-13)(a:2, b:\/4, c:3), 
(9) 2= νά (assumed here), 
(10) 3:2 -- 3/4 Inf, (8), (9). 


(c) Would we get a satisfying rule of proof by interchanging “hypothesis” 
and “conclusion” in (7)? Explain. | 
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If every law were stated as a theorem, axiom, or definition, each proof 
would consist of a single statement justified by one of the above rules. How- 
ever, many laws are not worth displaying as theorems even though they 
are needed to prove other more interesting laws. Accordingly, a proof 
usually consists of several steps, each one justified by a rule of proof. 

A complete formal proof is a sequence of steps such that: 


I. Each step ts a sentence. 


II. Hach step is an established law (axiom, definition, or 
(11) previously proved theorem) or is the result of applying a rule 
of proof to a-previous step or to an established law. 


ΠΙ. The last step is the theorem to be proved. 


A formal proof is a complete formal proof or an abbreviation of one ob- 
tained by omitting or consolidating steps and reasons. An informal proof 
is an expression of the steps of a formal proof in paragraph style by using 
everyday language. In this book we use the word “proof” to apply only 
to formal or informal proofs. “Show” or “argue for” are used for other 
discussions tending to convince. We call such arguments heuristic dis- 
cussions or plausibility arguments. (See Section 2-12.) 


(d) Cite examples of different kinds of proofs given in this book so far. 


If we relied only on the three rules of proof given above, proofs would be 
unreasonably long and cumbersome. It is therefore customary to cut 
proofs short by omitting steps, abbreviating reasons, and adopting fur- 
ther rules of proof. Such measures are considered perfectly legitimate 
provided they really do amount merely to ways of abbreviating proofs 
that could be carried out by the use of Rep, Sub, and Inf. 

One very useful rule of proof, known to the reader from high-school 
plane geometry, applies to proving laws of the form p—q. This rule 
permits inserting p as a step “by hypothesis.” Then if q can be derived 
from this assumption, we consider that p — qis proved. For example, 
consider the theorem 


(12) If three sides of one triangle are equal in length respectively 
to three sides of another, the triangles are congruent. 


When we use letters for variables, this becomes 
(12) [AB = XY, BC = YZ, and CA = ZX] > [AABC 2: AX YZ). 


Here AB means the length of the segment AB. In proving the theorem, 
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we assume the equations in the hypothesis and then reason step by step 
to the conclusion. When the conclusion is reached, the proof of (12) is 
considered complete, and we write “Q.E.D.,” an abbreviation for a Latin 
phrase meaning “that which was to have been demonstrated.” 

In using this method of proof, we treat the variables in the hypothesis 
as though they were constants. For example, the above theorem is sup- 
posed to hold for all values of A, B, C, X, Y, Z (that is, for all points), 
but when we assume AB = XY by hypothesis in the proof, we do not 
assume that AB = XY for all A, B, X, Y. To assume this would be to 
assume that all segments are equal! What we do, rather, is the following. 
We imagine A, B,C, X, Y, Z to be definite but unspecified points for which 
the hypothesis holds; i.e., we treat the symbols as constants. Then we 
show from this assumption that the conclusion holds for these points. 
However, since we have made no special assumption about these points, 
the same reasoning would enable us to reason from hypothesis to conclu- 
sion in every case. Hence the theorem is proved, since we are assured that 
its conclusion is true whenever its hypothesis is true, which is all that is 
required by (2-3-11). 


Rue or Hyporuesis: Jn order to prove an implication, 
its hypothesis may be introduced as a step and treated as a law, 
(13) except that the Rule of Substitution may not be applied to its 
variables. If the conclusion appears as a step, the implication 
is proved. 


When using this rule, we write “Hyp” after an assumed hypothesis. 
When the conclusion is obtained, it is justified by whatever rule is appro- 
priate. Then the implication is justified by writing Q.E.D. and referring 
to the steps asserting the hypothesis and conclusion. Or we may omit 
the statement of the implication and just write Q.E.D. when the conclu- 
sion is justified. Note that any sentence may be assumed by Hyp as a 
step. Note, also, that a step assumed by Hyp and steps derived from it 
are not necessarily laws. For this reason, substitution is not permitted 
in such steps; however, Rep and Inf can be applied to them. 

To illustrate the use of Hyp, we prove several laws of elementary al- 
gebra in continuation of Section 1-16. 


(14) (a = ὃ) > (ὃ = a). 
(15) ~~ Proof of (14): 
(a) a=a | Sub, (2-5-1) (a:a), 


(b) a= ὃ Hyp, 
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(c) b=a Rep, (a), (b), 
(d)(@=b)>(b=a) QED,, (b), (ὦ). 
Note that (6) was obtained by replacing the first a in (a) by ὃ, which is a 


synonym according to the assumption (b). We could have omitted step (d) 
and simply written Q.E.D. after step (c). 


(e) Similarly prove one of the following. 


(16) (a= b)—-(a+c=b+4 ὦ, 


(17) (a = ὃ) > (ac = be), 

(18) (a = b) > (—a = —4), 

(19) (a = b) > (α -- ς = ὃ — ὃ), 

(20) (a = ὃ) > (Λα = 1/b), 

(21) (a = δ) > (a/c = b/e), 

(22) (a = b) > (α" = δ), 

(23) G@Qte=b+0 >(=)d), 

(24) (c = 0 A ac = be) > (a = δ), 

(25) (a=bAc=d)—>(ac= bd Na+e=—b4+ a). 


(f) Does (20) hold for all numbers a and 6? (g) Does (21) hold for 
any number c? (h) In (15), which steps are laws? Gi) Prove that 
2=4— 10 = 20. (j) Could we use (i) and Inf to prove that 10 = 20? 


An argument (or proof) is called valid when it proceeds by applying 
laws of logic and rules of proof, or could be justified by reference to such 
Jaws and rules. The reader should note that any law of logic may be the 
basis of a valid argument. He should also notice that the validity of an 
argument does not depend on the truth or falsity of its premises or con- 
clusions. One cannot argue validly from true premises to false conclusions, 
but one can argue validly from false premises to either true or false con- 
clusions. On the other hand, because a man’s conclusions are correct it 
does not follow that his argument is valid. 


(k) Give examples to illustrate the comments in the preceding paragraph. 
Show that the following two arguments are valid: (1) If the price of butter 
increases, demand decreases. The price has increased. Hence the demand 
has decreased. (m) Free competition leads to price cutting and maximum 
output. Price cutting is rare in our economy and output is usually below 
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capacity. Hence free competition is not universal. (n) Kant held that all 
proofs of the existence of God were fallacious. Does this show that he was an 
atheist? 

An argument that is not valid is called invalid. A line of reasoning that 
is contrary to logical laws is called a fallacy. Many fallacies are based on 
misapplication of laws of logic or application of a logical formula that is 
not actually a law. Fallacies are often difficult to detect, but it is usually 
helpful to restate (or attempt to restate) a suspect argument in symbolic 
form. It should be kept in mind that the word “fallacy” refers to reason- 
ing and not to the premises or conclusion taken alone. 


(o) Collect and/or construct examples of invalid arguments and explain the 
fallacies involved. 


When we construct a theory on the basis of explicitly stated axioms and 
rules of proof, we have not really eliminated all possibility of controversy. 
For one thing, there may be differences of opinion as to whether we have 
correctly applied the rules of proof. But this is just a question of whether 
we have made a mistake. Mistakes may be hard to find, but such dif- 
ferences of opinion can be solved by sufficiently careful examination of 
the theory. Unsolvable disagreement is still possible, however, on whether 
the axioms and rules of proof should have been adopted at all, whether 
the theory that results from them is a good theory, and so on. Such 
questions are not answered by the theory itself, but they are placed out- 
side the theory when we agree to argue on the basis of the axioms and 
methods of proof. Hence we may expect universal agreement within the 
theory, but no universal agreement about it. To settle arguments about 
a theory we should have to construct a second theory about it. A theory 
about a theory is called a metatheory. Of course, there would remain 
areas of possible disagreement about the assumptions of any metatheory. 
Evidently we cannot eliminate disagreement or controversy, but we can 
construct a theory in such a way that disagreement is possible only about 
certain parts of it, namely the axioms and methods of proof. This is a 
great advantage because it leads to universal agreement over a consider- 
able area, avoids arguing about matters that can be agreed upon, and 
identifies the really controversial issues. 


PROBLEMS 


1. Suppose we have proved the Pythagorean theorem: Z ABC is a right 
angle > AB? + BC? = AC?. Now suppose that we have a particular triangle 
XYZ in which Z XYZ is a right angle, XY = 7 and YZ = 5. Give an informal 
proof that XZ2 = 52-4 72, Indicate the rules of proof used. 

2. Prove one of (16) through (22) with a complete formal proof. Rewrite it 
in informal style. 
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3. Prove others in the list (16) through (22). 

4. Prove that if the ocean is lemonade it contains citric acid, indicating your 
rules of proof. 

5. Accepting as a fact that the ocean contains salt, prove it contains sodium, 
indicating rules of proof. 

6. What rules of proof were used in Section 2-5? Insert reasons in the 
following complete proof of (2-5-7). 


(a) ~(p Ag) = ~DA 9, 

(b) ~~p = p, 

(c) itis! ΞΞ | f° 
(d)~pPAQ=~(~~pA~-~9, 
(0) wp Vag = ~(~~pA~~9, 
(ἢ ~pAQ=~pV ~¢@. 


7. Similarly prove (2-5-8). 
8. Give a complete proof of (2-5—28). 
9. Explain why the following schema is valid, and state a possible rule of 
proof based on it. 
Pq 
(26) “4 (valid) 


10. Explain why the following schemata are invalid. 


pd jh aa 
(27) YP. (fallacious!) . 4 . (fallacious!) 
" ~g s 3p 


11. From 1.5 = 3/2 and the law in (22), by what pattern can you conclude ~ 
that (1.5)? = (3/2)?? 

12. If two lines are parallel, they do not meet. If two particular lines do 
meet, what conclusion can you draw and why? 

13. Suppose you know that the squares of two numbers are equal. Can you 
draw from (22) the conclusion that the numbers are equal? Explain. 

14. Suppose two numbers are not equal. Can you draw the conclusion that 
their squares are not equal by relying on (22)? Explain. 

15. Prove (23) by using (13)(a:a + ο, δ: + ες, c:—c). 

16. Similarly prove (24). 

17. Show that (ας = bc) > (a = δ) is not a law. Detect the error in the fol- 
lowing fallacious proof. By (17)(a:ac, b:be, c:1/c) we have ac = be > ac({1/c) = 
be(1/c) or ac = ὃς — a = ὃ. 

18. Show that the converse of (22) is not a law. 

*19. Prove 


(28) (p= -(~p =~Q), 


(29) (pP=g—-(pAr=aqAr). 
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20. What conclusions can you draw from a? # 6?? 

21. If we know that an argument is valid and that it leads to a false conclu- 
sion, what can we conclude? 

22. If we know that an argument leads from correct assumptions to false 
conclusions, what can we conclude? 


ANSWERS TO EXERCISES 


(c) No; from p — q and q, p does not follow. (d) In Section 1-16 we gave 
informal and formal, but not complete formal proofs. (6) (16)a+c¢ = a+ ¢; 
a=b:ate=b+c;a=b-ate=b+e. (ὃ Not fora = ὃ = 0; 
but (20) is a law as it stands, since a law is a sentence all of whose values are 
true, and (20) is not a sentence for (α:Ὁ, 6:0). (g) Not fore = θ. (8) (a) 
and (4) only. (i) (17)(a:2, 6:4, ¢:5). () No, since we de not know that 
2 = 4. 


ANSWERS TO PROBLEMS 


1. We have given as true that Ζ XYZ is a right angle, XY = 7, YZ = 5. 
Now ZXYZ is a right angle > XY2+ YZ? = XZ? by Sub, (Pythagorean 
theorem)(A:X, B:Y,C:Z). Also, XYZ is a right triangle by Hyp. Hence 
XY2+ YZ2 = XZ? by Inf and the two previous steps. Hence 7? + 5? = XZ? 
by Rep. The equation to be proved then follows by Sub, (14)(a:7? + 5”, b:XZ?) 
and Inf. 2. (16) By the law of identity, α - ὁ =a-+e. Since a = ὃ by 
hypothesis, we may replace a by ὃ in the right member to geta+c = b+ ς, 
which is the desired conclusion. 4. Use Hyp and others. 5. Use Inf and others. 
9, (2-5-28) and Inf. 17. The proof is informal and incomplete, but it would 
still be in error if omitted steps were inserted. The point is that “1/c” is unde- 
fined for c = 0. Hence the substitution is not significant unless c ~ 0. Since 
ac = be > a = ὃ is still a sentence when c = 0, we must exclude this possi- 
bility by inserting c τέ 0 in the hypothesis, as we have done in (24). 


2-7 Laws of implication. We now use our additional rules of proof to 
prove theorems that enable us to reason from a sentence to another that 
is not synonymous with it. 


(1) pp (Law of tautology). 


This law is so obvious that it would seem out of the question to prove it. 
However, it can be proved by use of the Rule of Hypothesis. 


(2) Proof of (1): 
(a) p Hyp, 
(Ὁ) p— p Q.E.D., (a). 
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Note that (Ὁ) is justified, since we have assumed p= and so, of course, ob- 
tained p as a step, in conformity with (2-6-13)! 


(3) (p=Q—-(p— 9). 


This is proved by assuming p = g by hypothesis and then replacing 
p by q in the right member of (1). 


(a) Does it follow from (3) that 4 — 2? since 4 = 22? (b) Prove from (3) 


that ~~p-— p. (ὁ) Derive (4) and (5) from (1) by using (2~5-23) and 
(2-5-7). : 


(4) pV ~p (Law of excluded middle), 
(5) ~(p A ~p) (Law of contradiction), 
(6) (ᾧ ΔΛ 4) -»». 


To prove (6) it is convenient to first prove g > (p V ~p). Since this 
law is not important except for the purpose of proving (6), we do not list 
it as a theorem. Instead we call it a lemma. 


(7) Lemma: g— (p V ~p). 


Proof: 
(a) 4 Hyp, 
(b) p V ~p (4), 


()gq> (pV ~p)  Q.E_D., (a), (b). 
(8) Proof of (6): 
(a) [ῳ Λ 4) > Ὁ] =la—>(p V ~p)] (ὥ-ὅ-88), 


(b) (p Λ 4) > p Rep, (7), (a). 
(9) p— (pV q). 


*(d) Prove (9) by showing that it is synonymous with (6)(¢:~g). %¥*(e) 
Show by a truth table that (6) and (9) are not laws if we change — to «9. 
(f) Show that the converse of (9) is not a law. (6) Prove that (p A 4) > 4 
is a law by showing that it is synonymous with (6)(p:q, ¢:p). 


Although we cannot include the proofs of all theorems, we list those 
that are most interesting and not too complicated, and in an order such 
that each can be proved conveniently by relying on those already listed. 
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(10) p—{q> (pA Qi. 


Theorem (10) follows from (2-5-3834) and (4). It permits us to infer 
that p A q is true if we know that two propositions p and g are each true, 
for by (10), from p and Inf we can conclude that g — (p A 4). Then 
from g and Inf we can conclude (p A 4). 


(11) pogolpV 1) -" (ᾳ Ν 7)). 


Theorem (11) enables us to “add” the same proposition to both members 
of an implication. 


(12) [ρ -.. φ) A (qr>n) τὺ (ῷ - 7) (Law of syllogism). 


Theorem (12) is the principle of logic that permits us to construct 
chains of reasoning. Thus, if we know a --Ὁ b,b—c,c—d,...,%— y, 
and y — z, then we may conclude by using (12) repeatedly that a — Ὁ, 
a—-d,a—-xz,a—y, and finally that a — z. Arguments based on it 
are called “syllogistic.” 


* (13) Proof of (12): 


(a) (~g > ~p) > [(ωφ V 7) > (~P V 7)] Sub, 
(11) (p:~q, g:~Dp), 


(b) ῳ -" φ) olka Vr (ων )] — Rep, (a), (2-5-28), 


oo gy (Ὁ e717) Rep, (b), (2-5-23), 
(4) [ᾧ --- 4) A (ᾳ -- 7)] -- (Ὁ --Ξ 1) Rep, (0), (2-5-30). 
(14) pA Pgs 
(15) | I~g A (p> 4)] -Κ ~D, 


(h) How are laws (14) and (15) related to the discussion in Section 2-6? 
(16) DRAG FOL ΝΎ, 
(17) pVagAnrnrpvagaar. 


Laws (16) and (17) illustrate how implication and identity are quite 
different. With = in place of —, they would certainly not be laws. 


(18) po p, 
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(19) (p > 4) > (ᾳ © 5), 
(20) lpogd A (4 «» 5)] -᾿ (Ὁ - 1). 

(i) Which of (18) through (20) holds when © is changed to >? (ἢ) to =? 
(21) P= 4) -Ὁ (Ὁ «Ὁ φ).. 

(k) From (21) prove that (p -- 4) << (φ --» ~p) is a law. 


Note that (21) permits us to write a theorem on equivalence corre- 
sponding to each identity in logic. 


(22) (x= y) A Y= 2)] - ὦ = 2). 
* (22) Proof of (22): 


(agjtz=yAy=z Hyp, 


(b)a=y Inf, (6)(pia = y, gy = 2), 
(ec) y= 2 same, (2-5-2), 
(d)x=z Rep, (c), (b), Q.E.D. 


The following laws are stated here as examples from the infinite list of 
theorems provable from the axioms of logic. They were selected for their 
own interest or because we shall use them later. 


(23) p— (ᾳ --" p), 

(24) ap 5 (pg), 

(25) ( -Ὁ 4) -ο ἰῷ Ar -" (ᾳ Λ γ)], 

(26) ἰῷ -- 4) ΛΔ (" - 8})} -ο [ᾧ ΔΛ τ) - (ᾳ ΔΛ 9)], 
(27) ( - 4) -ο [ῷ ΔΛ γὴ) > 4], 

(28) P-gp (ᾳ ν 4). 


The first asserts that a true statement implies any statement. The last 
two say that a true implication remains true if we conjoin a statement to 
its hypothesis or disjoin one to its conclusion. 

(29) (pr) A (ᾳ -Ὁ )] -ο [pM ν φ) > 7, 
(30) q—\l(p A 4) = pl, 
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(31) ~¢g— lp V a) = pl, 

(32) ᾳ -" 1 Ν 4) « 4], 

(33) “το [ῷῷ ΔΛ 4) «» 4], 

(34) ( -- 4) «»Γ[( Λ 4) = pl, 

(35) ῳ -- 4) «»[ῷ Vg) 4), 

(36) ( «-» 4) -οἷί Λ τ) «Ὁ» (ᾳ Λ 1η)], 

(37) [ῳ «» 4) Ares] - [ᾧῷ Νν τ) «Ὁ» (ᾳ Ν 8)], 
(38) (pr) A Ὁ --Ὃ 8)} -ῦ [Ὁ -"Ὁ (ra 8s), 

(39) [[“Ὁ -Ὁ (ᾳ «» 7] Δ @onlrlreo@ Ν g. 


Any argument based on assuming a negation is called indzrect. The 
basis of such arguments is (2-5-28), but they take many forms, of which 
the following are typical. 


(40) (τα -Ξ 4) -Ὁ α, 

(41) {fo λίῳ A “ῷ > ab > α, 

(42) {pA lp A ~d 7 ~Pi > α, 

(43) [~q-> (p A ~p)] > «, 

(44) [(~g— p) A ~p] > 4 

(45) [(pA~-Qd-rAns op”, 
(46) [ῳ A ~q) > τρί = ῳ -» Φ, 

(47) [Ιῳ Δ “΄᾿Ὗἢ -Ὁ α] = (p>. 


An example of the use of (44) is the proof that two different lines parallel 
to the same line are parallel to each other. Let the theorem be written: 
(alle A bile A a # Ὁ) > (a|| δ). Suppose ~(a || b). Then a meets 6, 
say in P. Then (al|c) A (Ὁ || c) A (a and ὃ pass through P). But this 
is false since only one line can be drawn through a point and parallel to a 
given line. Letting g = a||b, p = (alle) A (δ llc) A (@ and ὃ pass 
through P), we have ~g — p and ~p. Hence by (44) and Inf we may 
conclude q, that is, a || b. Then the Rule of Hypothesis establishes the 
desired implication. 


(1) Use a similar argument to show that if two lines are perpendicular to the 
same line, then they are parallel. (m) Argue that if a man invests his money 
so as to maximize his return, then his rate of profit must be the same on all his 
investments. 
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Proofs by elimination are based on showing that the law to be proved 
is one term of a disjuction that is a law and each of whose other terms is 
false. It is based on such laws as the following. 


(48) lp V gq) A ~q] > Pp, 
(49) (pVqVrAnxqA οὐ > Pp. 


(n) Argue for the plausibility of (48) and (49). (0) If one angle of a tri- 
angle is greater than another, the side opposite the larger angle is larger. To 
prove this, we note that the first side must be greater than, equal to, or less 
than the second. The side opposite the greater angle cannot be equal to the 
other, since if it were the two angles would be equal. It cannot be less, for then 
the opposite angle would be less instead of greater. (We assume it is known 
that if one side of a triangle is larger than another, the angle opposite the first 
is greater than the angle opposite the second.) Hence it must be greater. Ex- 
plain how (49) applies. (p) What other logical ideas are involved in this 
argument? 

{t would be possible ‘to formulate additional rules of proof based on the 
theorems of logic. For example, because of (6), we could say that if p A q 
is a law, then p is a law. Instead of formulating such rules, we refer directly to 
the theorems when they are needed. (4) Formulate a rule of proof corre- 
sponding to (2-5-28). (1) Do the same for (15). 


PROBLEMS 


1. Show that the converse of (3) is not a law. 
2. Is [(p V g) «Ὁ 7] © [6p - (Qe 7)] ἃ law? Is it a law if the central © 
is changed to —? Justify your answers. 
3. Rewrite (7) informally. 
*4. Explain (13) by giving a more complete proof. 
*5. Rewrite (13) informally. 
%*6. Discover and prove some laws involving V . 
7. Prove that (p «» 4) © (wp αὐ. 
8. Formulate a rule of proof corresponding to (3). 
9. Formulate rules of proof corresponding to other laws of implication. 

10. Prove (p — 4) > (qe p). 

11. Assuming that there is one and only one line passing through two different 
points, prove that two lines intersect in at most one point. 

12, A prime number is by definition a natural number greater than 1 with 
no factors other than itself and 1. Euclid proved that there is no greatest prime 
number in the following way. Suppose that there is a greatest prime. Let us 
call it N. Now consider all the prime numbers, 2, 3, 5, 7,..., N. Let us con- 
struct a number M equal to the product of all of these plus 1, that is, M = 
(2-3-5+7-11...N)+ 1. Now M is not divisible by any of these primes, 
since there is always a remainder of one after such a division. Hence M is a 
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prime, since if it had a factor other than itself or 1, this factor would have to 
be a prime or be divisible by a prime, and in either case M would be divisible 
by one of the primes. But obviously M > N, and hence Ν is not the largest 
prime! Explain the use of indirect argument in this proof. 

13: Show that the converse of (21) is not a law. 


ANSWERS TO EXERCISES 


(a) No; only sentences are significant substitutes for the variables in (8). 
(b) ~~p = p. Hence by (8) and Inf ~~p— p. (c) (po p) = 
(~pV p) = pV ~p = ~(~p A ~~p) = ~(~P A Pp) = ~(p A ο»Ἐ). (ἃ) 
(θ) (ῳ: 4) ={pA ~d) - vp) = σῷ A ~®@ V Bp lby (2-5-23)] = ~p V 
“ΟΜ ΡΞ ΡΝ @V vp =~pV PVQD=P7> PV. (ἢ When 
p is false and q true, it fails. (g) The substitution yields (¢ A p) — ᾳ, which 
is the same as (p A 4) ~ q. (h) They embody the schemata (2-6-6) and 
(2-6-26). (i) First and third only. (7) All. (k) (2-5-28). 

(Ὁ ~(a || δ) — [a meets ὃ in some point]. Call this point P. Then through 
P there are two lines Lc. But this is false, since only one line can be drawn 
from a given point perpendicular to a given line. (m) If his rates of profit 
are different, he can increase his return by transferring some resources to the 
more profitable investment, and hence his profit is not maximum. The con- 
clusion follows by (2-5-28). (0) (49)(p:first greater than second, q:first equals 
second, r:first less than second). (p) Each possibility is shown to be false by 
showing that it implies a false statement, i.e., by applying (44)(¢:~q). (ᾳ) To 
prove an implication it is sufficient to prove its contrapositive. (r) To prove 
a statement false, it is sufficient to prove that it implies a false statement. 


ANSWERS TO PROBLEMS 


1. (2 = 5) > (2 = 2). 3. We may assume 4 by hypothesis; also, p V ~p 
is a law by (4). Hence, in accordance with the Rule of Hypothesis, the desired 
conclusion appears as a step in a proof in which the hypothesis is assumed, 
and we have proved the implication. 7. (pg) = (Ὁ - 4) A (ᾳ -"»} = 
(~qg>~p) A (~p > ~@ = (~p ee ~g. Then use (21), 12. The main 
argument is justified by (43)(¢:There is no largest prime, p:N is the largest 
prime). To show M is prime we use (44)(q:M is prime, p:M is divisible by a 
prime). 


2-8 Manipulating algebraic equations. Much scientific discourse 15 
formulated in terms of equations. The following are some of the problems 
that’ arise. 

I. To prove that a given equation ts an identity. Numerous examples in 
logic and the algebra of numbers have already been given. 

II. To prove that a given equation is not an identity. This is often most 
easily done by citing a counterexample (a single example for which it is 
false). 
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11. Yo prove that a given value of the variable in an equation is a solution. 
The proof is accomplished by substituting in the equation and proving 
that the result is an identity. Hence this problem reduces to I. 

IV. To find some solutions of an equation. This is quite a different matter 
from III. Indeed, before we can try to prove that a number is a solution, 
we must find it! The difference between III and IV is very much like the 
difference between the jobs of a prosecuting attorney and a detective. 
A trial cannot be held without an accused! The prosecuting attorney 
(III) wishes to prove that the accused is guilty, and he is bound by very 
strict rules (rules of evidence, court procedure, etc.) He wishes to prove 
that the accused is a solution of the sentence “x committed the crime.” 
On the other hand, the detective (IV) wishes merely to find the guilty 
party. In this search he may guess, use his intuition, listen to gossip, and 
do all sorts of things that would not be considered proof by any court of 
law. Similarly, we may use any convenient method to find solutions of an 
equation, but we must not imagine that this is the same thing as proving 
that what we have found is indeed a solution. 

V. To find all solutions of an equation, i.e., to solve it. This involves 
more than III or IV. 

VI. To derive some equations from others. This problem is more general 
than I, since here we do not require that the equations be identities. It 
arises very frequently in science when the scientist assumes that certain 
relations hold among values of his variables and desires to discover the 
consequences of this assumption. 

To solve such problems we make use of manipulations of several kinds: 

A. Manipulations yielding synonymous sentences. If we replace an ex- 
pression in an equation by a synonym, the result is an equation synony- 
mous with the original. Moreover, by (2-7-21), the original and resulting 
equations are logically equivalent. Because of the equivalence, any value 
of the variable that is a solution of one is a solution of the other. If one 
is an identity, so is the other. If one is not, neither is the other. If we find 
all solutions of one, we have found all solutions of the other. 


(a) Why does an equation imply any equation found by a manipulation of 
Type A above? 


For example, to prove that —2 is a solution of x? + x — 6 = 0, we 
may write 


(1) 274+2—-6=0644+2-6=0+0= 0. 
Here each change involves a replacement. Since the last equation is an 
identity, so is the first. 

(b) Prove that —3 is a solution of (1). 
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B. Manipulations yielding logically equivalent sentences. These include 
not only those of Type A but also other manipulations that yield sentences 
that are equivalent without being synonymous. For example, 


(2) 2x -- 1-| «» 2.) = Il, 


because 22 --ἔ 1--Ξ 0 -» 25 - 1 by (2-6-16)(a:22 — 1, δ:0, 6:1), and 
2e τὸ 1 - 2x —1=0 by (2-6-16)(a:22, b:1, c:—1). Manipulations 
of this kind have the same advantages arising out of the equivalence as do 
those of Type A. 

The most useful laws that justify manipulations of this type are the 
following. | 


(3) (a=bbeo(fatec=b+o), 
(4) (c ¥ 0) > (a = beac = be), 
(5) (αὖ -Ξ- 0) «» (@a=O0V b= 0). 


The first law follows from (2-6--16), (2-6-28), and (2-3-17). To prove 
the second, we note that a = ὃ -- ac = be. Hence by (2-7-23)(p:a = 
ὃ — ac = be, gic ~ 0) and Inf, we may assertc ¥ 0 - (α = ὃ - ac = 
bc). From this and (2-6-24) we have (4) by using (2-7-38) and Inf. 

We can argue informally for (5) as follows. First to prove ab = 0 - 
a= 0 V b= 0, we assume ab = 0 by hypothesis. If a = 0, we have 
the conclusion. If not, (4) justifies multiplying both members of ab = 0 
by l/a to get ὃ = 0. Second, (α = 0 V ὃ = 0) — (ab = 0) follows 
immediately from (1-9-3830) and (1-9-31). Hence, by the definition of 
logical equivalence we have (5). 

To prove (5) more rigorously we use (2-7-39)(p:e = 0, g:a = ὃ, 
riac = bc) to get [le ¥0—- (α = beac = δ0)] A (ὁ ΞΞ 0 - a= 
bc)] — [ac = be «» (c = 0 V a= b)]. The hypothesis of this implication 
is a law, since its first term is just (4) and its second term follows im- 
mediately from (1-9-30). Hence by Inf, its conclusion is a law; that is, 


(6) (ac = bec) «-» (ες = 0 V a= δ). 
This law is sometimes useful as it stands. We can get (5) from it im- 
mediately by the substitution (6:0). 
We illustrate the use of these laws by the following manipulations. 
(7) QW? +2%= 12427 +27= 6 (4) 


(8) orvta«z—6=0 (3) 
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(9) © (x -- 83) — ὃ = 0 (1-12-7) 
(10) or+3=0V2r—2=0 (5) 
(11) er=—-3V2r=2 (3). 


Each sentence is logically equivalent to the preceding one for the reason 
given. Hence we have simultaneously solved problems III, IV, V for this 
equation. We know that —3 and 2 are roots and all the roots. 


(ὁ) Which steps in (7) through (11) involve replacements? (4) Which 
sentences are synonymous? (6) Solve 3 — 2x = 81 - 1 by such manipula- 
tions, giving reasons. (ἢ) Do the same for «* — x — 6 = 0 and for (g) 
Qn? + 824+ 8 = 


An equation of the form ax + b = 0 (where a and b do not involve z, 
and a # 0) is called a linear equation in one variable. Equations that can 
be reduced to this form by replacements and manipulations like those in 
(3) and (4) are also called linear. The solution of such an equation is trivial. 
(12) ax +b=Ocxar = —b 


(13) x = —bd/a. 


(h) Solve, giving reasons: 7/2 + 16 — 5a = --2(. - 8). 


An equation of the form ax? + bx + ὁ = 0 (where a = 0, and a, ὃ, 
and ¢ are independent of x) and equations that can be reduced to this form 
are called quadratic. Manipulations similar to (7) through (11) may be 
used to solve any quadratic equation. 


az? + ba +c=0 

(14) <> 2° + (b/a)x + (c/a) = 0 

(15) <+ “5 + (b/a)x + (b/2a)? — (b/2a)? + (c/a) = 0 

(16) «» (x + b/2a)? — (b?/4a? — c/a) = 0 

(17) «» (a + b/2a)? — (6? — 4ac)/4a? = 0 

(18) «(4+ b/2a+ bt (x + b/2a — Wb? — 4ac/2a) = 0 


(19) οα ἘΞ ὙὉΣ " Ά46ς τ νυ δ τὸ ν᾽ τὰς Gt = 0 
25 es AES a sop cll Ware 
(20) og = POE ey DV te, 


(1) Give the reasons for steps (14) through (17), (19), and (20). 


108 ELEMENTARY LOGIC [cHaP, 2 


Step (18) is obtained by factoring (17) as a difference of two squares, 
that is, by using 


(a? — b? = (a+ δ)ία — D)](a:z + b/2a, b:Vb? — 4ac/2a). 


However, if b? — 4ac < 0, Vb? — 4ac is not a real number and does not 
even exist in the number system we have used so far. (Why? For a proof, 
see Section 3-5.) Hence, for the present when b? — 4ac < 0 we say that 
the equation has no real roots. In Chapter 6 we extend the number system 
so that any number has a square root. Then (14) through (20) proves 
that any quadratic equation with b? — 4ac τέ 0 has two roots given by 
(20). When b? — 4ac = Ο there is only one root, but it is customary to 
say that the two roots are equal. We call b? — 4ac the discriminant of 
the quadratic. Usually (20) is written 


—b + Vb2 — 4ac 
2a ; 
and the expression giving the two values of z is called the quadratic formula. 


(j) Carry through (14) through (20) with 27+ 2z—4=0. Use the 
quadratic formula to find the roots of: (k) 27 — 25 —4 =0,and (I) 2 + 
6z2-+9 = 0. Solve: (m) 322 -ἰ ὅς — 1 = 0, and (n) 327 — 544+ 1 = 0. 


(21) αὐ +br +e =O00C2= 


C. Manipulations yielding a necessary or sufficient, but not a logically 
equivalent, sentence. For example, to solve x7 = x we may divide both 
members by x to get x = 1. According to (2-6-17)(a:2, b:1, c:z), x = 
1— 2x? = x. Hence 1 is a root of +? = x. But the converse is false; 
that is, 42 = , ῳ — x = 1 is not a law. Indeed, its hypothesis is true and 
its conclusion false for x = 0. This manipulation yielded a sufficient but 
not necessary condition for the original equation, and one root was lost. 
Of course, (6) yields 


(22) g?=—=azox#=O0V2e=)). 


Or we could solve the equation by rewriting it as “7 — x = 0 and then 
factoring or using the quadratic formula. 

As a second example, consider the equation 1/z + 3 = 1/zx. Multi- 
plying both members by xz, we have 1+ 34 = 1, 3u = 0, x = (. But 
0 is not a root of the original, since “1/0” is undefined. The difficulty arose 
because 1... + 3 = 1/r ~1+ 3x = 1, but the converse is false. 
Hence we gained a root by this manipulation. Here we found a necessary 
but not sufficient condition. Again the difficulty could be avoided by 
sticking to manipulations of Type B. By (4) we have 


(23) 2¥#0-> (/e+3 = 1,0 451 -Ὁ 85 = 1). 
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And since 0 is the only solution of the right member of the equivalence, 
there is no root. 


(0) Show directly that 1/z + 3 = 1/z has no solution by showing it logically 
equivalent to 3 = 0. 


The above examples make clear the risks in manipulations of this type 
when solving equations. As long as we stick to manipulations of Type B 
we neither gain nor lose roots, but those of Type C may do either. How- 
ever, manipulations of Type C are convenient in some situations, pro~ 
vided they are used carefully. 

For example, to solve the equation /z + 1 = /4z, we may use 
(2-6-22) to write 


(24) Vetl= VY4e>r+2/e+1= 41 

(25) — 2\/x = 82 — 1 

(26) -ῷ 47 = θυ — 6r + 1 
(27) — 92? — 10r +1=0 
(28) — (θα -- 1) — 1) = 0 
(29) ~xz= 1/99 Vr=1. 


Since the converse of (2-6-22) is not a law, we cannot use equivalence 
signs here. The implication in (29) means that if x satisfies the original 
equation, it must equal one of the values given by (29). But it does not 
follow that if x equals one of the values given by (29), then it must sat- 
isfy (24). Indeed, substitution in the original equation shows that 1 is a 
root but 1/9 is not. Here this procedure caused no error, since our manipu- 
lations did not lose any roots and we tested the alleged roots it yielded. 


(p) Verify the statements about 1 and 1/9. (q) Justify (24) through (29). 


As an example of the usefulness of manipulations of Type C in deriving 
some equations from others in science, we consider the following equations 
taken from an article in the Autumn, 1956, issue of Educational and 
Psychological Measurements. 


(30) AL, = 4s— As) + (As — Az) + (As — 42) (As — Ad) 
L. t 


(31) Ace = Ai + Ag + Ag + A4 + As, 
(32) AL. = (545 — Acc)/4. 


The authors say that (30) and (31) imply (32). Now by replacements we 


i= 
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find that (30) is equivalent to 

(33) A.L. = (445 — (Ay + Ag + Az + 4.4))Λ4. 

Also, (31) is equivalent to 

(34) Ace — As = Ay + Ag+ 43 Ὁ Ag. 


Now we make in (33) a replacement of the right member of (34) by its 
left member. After slight simplification, the result is (82) as desired. We 
see that by assuming (30) and (31) we can derive (32). Hence [(30) A 
(31)] — (32) by the Rule of Hypothesis. But the converse certainly does 
not hold. 


(r) Carry through all the details required to verify the above argument. 
Show that the converse fails by finding numerical values for which (32) holds 
but at least one of (30) and (31) is false. (8) The following equations are 
taken from an article in the Spring, 1955, issue of Educational and Psychological 
Measurements. | 


(35) R; = (1 — ΟἹ; + en, 
(36) R;+ WwW; =n, 
(37) T; = Ry — | “. W;. 


Derive (37) from (35) and (36). 


The discussion of this section refers to equations. However, the general 
comments under I through VI and A through C apply more generally to 
any sentences. Examples will be given in later sections. 


PROBLEMS 


1. Prove that (27 + y?)(a§-+ ys) = (aite — yiy2)? + (tiy2+ x21)" is 
an identity. 
2. Prove that a2 = b πο a = V6 is not a law. 
3. Prove that 1/a is a solution of x? + 1/2? = a? + 1/a’. 
4. Find three other solutions of the equation in Problem 3. 
5. Find all the solutions of x(x? — 1)(x + 2)(52 — 82 — 4) = 0. 
6 
7 


— then P = p+ (1/3)(1 — P). 


3P 
. Show that if p = 
. Prove 
(38) (a2 = 035) «» (a = ὃν a = —D). 


8. Prove (a # 0 A ὃ # 0) -» (ab # 0). 
9, Prove (a = 0 A ab = 0) > (6 = 0). 
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10. Sometimes people try to prove an identity by showing that it im- 
plies a true statement. Thus, in place of (1) they write 22- 2 --ἰ6 = 
0—-4+2—6=0-—0 = 0. What is the fallacy in this argument? 

11. Find the fallacy in the following: Let 2 = y. Then 4 = 2y or 4 — 2y = 0. 
Factoring, we find 2(2 — y) = 0. Dividing by 2 — y, we find 2 = 0(1). 


12. Solve “72 — 2/3 = 4. 13. Solve z/3 — (1 — 2x) = 72 - 9. 
14. Solve az - ὃ = cx +d. 15. Solve z? + z—1 = 0. 
16. Solve x? + ajx + ap = 0. 17. Solve 52? — x — 1 = 0. 
*18. Prove by substitution that the quadratic formula gives roots of the 
quadratic. 


*19. Find (21) by the change of variable x = y — b/2ain ax? - br +c = 0, 
solving the result for y, and substituting back for zx. 
* 20. Show that if 71 and zg are the roots of 77+ ατῷ - a9 = 0, thenr; + re = 
—a, and 7179 = ao. 

21. Show [ἡ = (N — 24)/2m1 A ae = 21 +h] - [xe = (N + 2?)/2z24). 

22. From the conclusion of Problem 21 and x3 = (N+ 2%)/2ze, find a 
formula for x3 in terms of 2; and N. 

23. (This and the next problem are taken from articles in Educational and 
Psychological Measurements, Summer, 1956.) Show that 


U(n — L) —L(n— U) _ eur 
n? 7 n 
24. Solve for 712: 
1 -- (rie + ris + r33) + 2rierisre3 = 0. 
25. (This and Problem 26 are taken from The Geometry of International 
Trade, by J. ἘΠ. Meade.) (a+ ὃ = c) — [a/b = 1/(c/a — 1)]. 
26. [e’ = E/Q At = I/eQ+ BE) -- [i = (1/e)(I/Q)/ +e’) A (1/e)(1/Q) = 


a(1 - 67]. 
27. Solve for zx: 


2x —a be — cx 


= —___ fora = 2,6 = —l,c = 8. 
b a 
28. Solve for z: 
ὃς οὗν 
x—-a=—+— fora = —3,b = 0,c = —2,d = —2,e = 4. 
d de 
29. Solve for z: 
a — I 1 1 
cS ae ge ςςς a for θα. - 7 = 0. 


— re eg 
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31. Show that [(—a)(—b) = —ab] > (—a = a). 
32. Show that if [Pb++] = [HSOq7]-+ [501], [Pbt*][SOq] = KSP, and 
= + 
[H+[SO7] = Ke[HSO7], then [Pb*t+]? = “Sot KSP. (Note: The 
brackets stand for concentrations of the chemical named within.) 
33. (This and Problems 34 and 35 are taken from Mathematical Introduction 
to Economics, by G. C. Evans.) Solve for u: 2u/a — b/a — 2Au — B = 0. 


Ὁ - Ba 
34, Solve for p: ae μα ap + ὃ. 
ps _ 
35. Solve for p: sar ap + 6. 


36. (This and the next problem are taken from Mathematical Biology of 
Social Behavior, by N. Rashevsky, pp. 44 and 139.) Solve for 21 -+ 22: 


αι + ag — 2δι(αι + 12) = 2box, + 2222. 


37. Find «if 1/(14 2) — B = 0. 

38. Show that —4(a — 1/2)(y — 1/4) + 5/2 = —4ay-+ 2+ 2y+ 2. (1η- 
troduction to the Theory of Games, by J. C. C. McKinsey, p. 22) 

39. Show that if Ay = y1 — yo and Az = x1 — Zo, then 


(x — xo) Ay _ 
χ τι — 70 


yo + [χι — yo) + yori — xoyil. 
40. (From An Introduction to the History of Mathematics, by H. Eves, p. 184) 
Solve for x: 
V{(2/8)(7/4) Ba)? — ὅ2- 8 _ 


10 2. 


41. (From Statistical Methods, by G. W. Snedecor) Solve for 2: 


(a —az)(d — 2) _ 


(b+ “)ίς + 2) 
42. Show that 


43. Show that 


p=q—-—-1AP=Q-1 
2 2 
ON NYS it ON NY τ, 
pN QN PQ 
44. Show that 
; ; oS aad _ Iplog (£/E’) 
jd — f(A d) = log => d = αἱ : +4]. 


(Note: You do not need to know the meaning of log (E/E’) to solve this 
problem.) | 
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ANSWERS TO EXERCISES 


(a) (p = 4) > (pq). (b) (—3)?+ (—3) —-6 = 009-3 --ἰδᾷα = 
0-0=0. (c) Q)only. (d) (8) = (9). (6) 3 — 2z = Br +10 — 22 = 
3a — 24» —5r = —2 <> 2 = 2/5. By adding —3 to both members, adding 
—3x to both members, dividing both members by —5, and simplifying. See 
Sections 1-9 and 1-14. (ἢ 2 ~zx—6 Ξ 09» (¢ — 3)(rx + 2) = 00 
e—-38=0V2+2=0027=3V 2 = -2. (5) 2? +44+4=0¢ 
(c+ 2)? = 0<>x2 = —2.  (h) Cheek solution by substitution. See Sec- 
tions 1-9 and 1-14. 

(1) (14):(4), (15):(1-9-20) and (1-9-22), (16):(1-12~9), (17):(1-14-2) and 
(1-14-3), (19):(5), (20):(3). (j) Instead of substituting (a:1, 6:2, c:—4) 
throughout, carry through each step. (k) « = (24+ V/20)/2 =1+4 V5. 
Note that by (1-13-22), V20 = VW4+5 = VW4./5 = 2V5. (1) Double root, 
3. (m) (—5 + V37)/6. ( (5 + V37)/6. (0) Add —1/z to both 
sides. (p) By substitution. (q) (24) and (26) by (2-6-19). (s) Replace n 
in (35) from (36) and solve for Τὶ. 


ANSWERS TO PROBLEMS 


1. Expand each member. 2. (a:—1). 4. a, —a, —l/a. 5. 0, £1, —2, 4, 
—1. 6. Solve for P and rearrange. 7. a? = b? <> a? — 0? = 06 (a+ 5) 
(a — ὃ) = 0. 10. (2-6-27). 11. Division by 2 — y is division by 0. 12. Check 
alleged solutions by substitution. 14. (ὦ — b)/(a — cc). 15. (—1 4 VW5)/2. 
17. (1 + V21)/10. 


2-9 The functional notation. It is often convenient to have an abbre- 
viation for a formula that appears several times in a discussion. For ex- 
ample, we might wish to talk about x? — x -+ 1 and its values for various 
values of x. To avoid having to write out the formula each time we wish 
to mention it, we could adopt a single letter as an abbreviation. Thus we 
might let s = x? — x+ 1. However, if we wish to indicate what hap- 
pens when substitutions are made in a formula, a single letter is unsatis- 
factory because it does not involve the variable explicitly. The difficulty 
is overcome by adopting an abbreviation in which x appears. We choose 
some letter, such as f, and follow it by the variable in parentheses, as in 
f(x)(read “f of x”). Then we adopt a temporary definition, such as 


(1) Let f(z) = 2? — 1 -Ε 1. 
Now (1) says that f(z) = 2? — x+ 1 is an identity by definition. 


Hence we get identities from it by any significant substitution for Ζ. We 
must, of course, substitute for x throughout. Thus 
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(3) fa) =a? ~a+], 

(4) IQ = -- aL 

(5) fety=@et+y’? -- ὦ - ψ) -ἘῚ, 
(θ) fw?) = (ἢ) -- 2? +1, 

(7) f(x?) = (5) — 23 +1, 

(83) fi? +1) = @?41)?-@+)41 


In this way we can use the notation to indicate any substitution in the 
original formula. Thus f(2) means the value of f(z) when xz = 2; that is, 
f(2) = [f(x)|(:2). Indeed, for any value of a, 


(9) f(a) = [f(@) (via). 


We read f(x) as “f of x.” It does not mean “f times x.” Multiplication 
is not involved in any way. It means the value of a certain formula cor- 
responding to the value of x. To find f(a), where a is any significant sub- 
stitute whatever for x, we simply substitute a, however complicated it 
may be, into the formula for which f(x) has been adopted as an abbrevia- 
tion. Thus if we let f(x) = [x = 3], then f(2) = [2 = 3], f(3) = [3 = 3], 
f(a? + 2) = [a? +2 = 3], and so on. 


(a) Let f(z) = x —1. Find f(1), f(2), f(—3), FO), f@ — 1), fle + 2), 
fic+4),flatb+ec). (Ὁ) Let Nv) = ~x. Find N(p), N(~p), N(N(p)), 
N(p V q), N(p A 4). Comment on ‘N(7).” 


How can the reader know whether an expression such as s(x) means “s 
times x” or “s of x”? He has to tell by the context. However, he is aided 
by the following conventions: (1) Ordinarily letters near the center of the 
alphabet are used in the way explained in this section, especially f, g, h, 
F, G, H. (2) When multiplication is meant, and confusion might occur, 
we write s- 2, (s)(x), or sx. Hence, in the absence of indications to the 
contrary, s(x) stands for “s of x.” 


Find f(—1) for: (0) f(z) = («— 1)”, (ἀ) f(z) = --, (0) f(x) = 2’, 
(ἢ f(z) =a+ty, (ge) f(z) = x3. (h) Argue that if n(x) = —z, then 
n[n(x)] = 2. 

If F(x) = the father of z, F[F(x)] = the father of the father of z = the 
paternal grandfather of x. Let M(x) = the mother of ᾧ. Find: (1) M[M(2)], 
4) M[F(@)], (Ὁ F(FIF(2)). 


Reading “f-of-x” for f(x) may seem strange, but it is a natural outcome 
of ordinary ways of speaking. Thus, suppose we let f(z) = x” = the 
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square of x. Here f takes the place of “the square” and the parentheses 
indicate “of.” Similarly, in the last exercise F stands for “the father.” 
We shall discuss the meaning of f in f(x) more fully in Chapter 5. For 
the present the reader may think of it as standing for the relation between 
the values of x and the corresponding values of the formula. 

The functional notation is very widely used in mathematics, and for 
this reason it is worth while to develop some skill with it. It is very simple 
to use if we remember simply that whatever appears within the paren- 
theses after f in f( ) is to be substituted for x in the formula for which 
f(x) is an abbreviation. Then the result is to be interpreted as the value 
of the formula corresponding to this value of x. If α is a constant and the 
formula contains no other variables, f(a) is a constant synonymous with 
the constant resulting from substituting @ for x in the formula. For ex- 
ample, if f(z) = x? and a = 2, f(a) = f(2) = 4. If a is a variable or 
the formula involves other variables, f(a) is a formula synonymous with 
the new formula obtained by substituting a for x. For example, if f(z) = 
2y + x, f(2) = Ὧν + 2 and f(x?) = 2y + x”. 


Let f(z) = 8) - Ἑ 2 and g(x) = 1 -- ζ. Find: (1) f(4), (m) g(A4), 
(n) f(x - 2), (o) γί! — 2), (p) ff), (a): g(g(2)), (r) f(g(x)), 
(s) g(f(z)). 


Often the functional notation is used to stand for an unspecified formula. 
When f(x) is used in this way, both f and z are variables, whereas when 
f(x) has been temporarily defined, only x is a variable. Indeed we have 
been using the functional notation in this way, for we intended (9) to 
hold no matter what formula had been abbreviated by f(z). 


(Ὁ) How would you interpret p(x) — q(x)? What sort of formulas would 
be significant substitutes here for p(x) and q(x)? (u) Suggest a substitute for 
p(x) and g(x) for which the implication is a law. 


Later, when the functional notation is used to stand for any one of 
many formulas, we shall be able to make substitutions for the letter in 
front of the parentheses. For the present we treat “f(xz)” as though it 
were a variable, and we consider any term to be a significant substitute 
for “f(z)” and similar expressions. But we have as yet no constants that 
are significant substitutes for “f.” The reason we say that any term may 
substitute for “f(x)” is that we may wish to substitute a constant or an 
expression not involving “x.” For example, suppose we let f(z) = x — x. 
Since the right member is 0, we have f(x) = 0 for all values of “x.” This 
may seem strange, but it is often convenient to let “f(z)” stand for a 
constant. 


(v) Let f(z) = (ὦ — 1)? — x? - 22. Find f(3), f(7), f(0), f(—3). Show 
that f(z) = 1 for all values of zx. 


116 ELEMENTARY LOGIC [cHAP. 2 


The functional notation can easily be extended to deal with formulas 
in which more than one variable appears. We simply adopt some letter 
as before and place in parentheses after it a list of the variables in which 
we are interested. Thus we may let f(z, y) = x — y. Then we may as 
before make any significant substitutions throughout. For example, 
It is important, of course, to notice the order of the variables in the ab- 
breviation and place the substitutes in the same order. | 


Let h(x, y) = «+ 2y. Find: (w) h(2,3), (x) λία, ὃ), (5) A(—1, —2), 
(4) h(0, 0). 


PROBLEMS 


In Problems 1 through 3 let f(z) = 1 — x?, g(x) = 22. 
1. Find f(z) + 1, f(z + 1). 
2. Find g(x) — 3, g(x — 8). 
3. Find f(z) + fly), f(@ + y). 


4. State a conclusion from Problems 1 through 3 in terms of the notion of 
distributivity. 


Find f(f(x)) for f(x) defined as in 5 through 10. 


5. 2°. 6. —z. 7. “οἱ. 

8. 1/a. 9. 2x. 10. 3 — Ζ. 

Find f(g(z)) and g(f(x)) where f(x) and g(x) are defined as in 11 through 15. 
11. χΖ, να with x > 0. 12. 2222/2. 13. —x, —2. 

14. 2. — 1, (1 + 2)/2. 15. 1+ 1/2, 1/(@ — 1). 


In Problems 16 through 26 let T(x) = the truth value of z. 


*16. What are the significant values of x? 

*%17. What are the values of T(x)? 

*18. Show that Τῷ A gq) = T(p): Tq. 

W119. Τί») = 1 — T(p). 

*20. T(p V q) = the larger of T(p) and T(q) or either if they are equal. 
*21. Τί A 4) = the smaller of T(p) and T(q) or either if they are equal. 


*22. T(p V 4) = the smaller of ΤΩ) + T(q) and 1. 
*23. Tip > 4) = TIT(p) S TQ). 

Ἀ24. Tip<q) = TIT(p) = TQ). 

*25. Tip 4) = ΤΩ) + T@ — Tp): TO. 
*26. Tip V a) = |p — 4]. 


27. If f(z, y) = x — y, prove fly, z) = —f(x, y). 
28. Find some definitions of f(z, y) so that f(z, y) = fly, x). 


29. f(z, y) ΞΞ 52 ἂν Y τ LY. Find f(2, y); f(2, x), LY, x), f(z, f(z, y)). 
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30. Repeat Problem 29 for f(z, y) (y= 20), 
31. Repeat Problem 29 for f(x, y) = y/z. 


In Problems 32 through 35 let J(p,¢g) = p A gq, D(p,qg) = p V a, N(p) = 
~p,C(p,q) = p > 4. 
%32. Argue that D(p, ῳ) = D(q, p). 
* 33. Show that N[D(p, q)] = JIN(p), N(q)]. 
% 34. Define D and C in terms of J and N. 
*%35. State (2-7-23) and (2-7-38) using these symbols. 


I 


36. Ifh = heredity, ἡ = environment, and p = the personality, what would 
it signify to write p = f(h, 1). 

37. Ift = the tax rate of z, J = the income of x subject to tax, and 7 = the 
tax paid by z, what would we mean by T = g(t, J, x)? Would it be adequate 
to write T = g(t, I)? 

38. As the reader may have observed, we are almost always omitting the 
quotation marks called for by the discussion in Section 1-3. He may find it 
interesting to decide where quotation marks could properly be inserted, espe- 
cially in the present section. | 


ANSWERS TO EXERCISES 


f@—1) = ὦ --ὀ Ὁ —1,fe+2) = @+2)-—1fe+4 = 5. 2 -- Ἰ, 
flato+c) Ξ ((-Ἐδ- ὦ --᾿᾿. (Ὁ) ο, Ὁ Ρρ, ἡ ριον 4), μῷ Δ φ.. 
“N(7)” is nonsense. (0) (~1 --- 1)32, (4) —(—1). (0) (—1)2. (f) -1-Ἐ y. 
(g) (—1)%. (δ) n[n(z)] = n(—x) = --(--) = x. (i) The maternal grand- 
mother of x. (7) The paternal grandmother of x. (Κ) The paternal great 
grandfather on the father’s side of x. 

(1) 14. (m)—3. (ἃ) 8.ὄ -ἰ 2)- 2. (0) -- ( -- 2. (p) 3(8¢+ 
2) -Γ 2. (4) 1 -- ἰὀἰἀ3ά -- α)ὴ. (Ὁ 8( -- 2) -Ὁ 2. (s) -- (8x4 2). (0) Some 
propositional formula implies some propositional formula; sentences. (u) Let 
p(x) = (x = 1), az) = (2% = 2). (Ὁ) All substitutes yield 1, since f(x) = 
α" — Qen+1—2?+ 22 = 1. (w) 8 (x) a+20. (y) —5. (2) 0. 


ANSWERS TO PROBLEMS 


LS Be ee) “2 oes. Gee 8) Se Se De 
Ι -- ᾧ -Ὁ ψ)2. 5.0%. 7. χ. 9. 4a. 11. 2. 18. 2. 15. 2 16. Propositions. 
17. QOand 1. 27. fly,z) =y— «a = —(@—y) = —f(z,y). 29. 4+ y -- 2Qy, 
4-+- 2 — 2x, γῆ τυ — ay, 227+ (2? + y — zy) — a(z? + y — ay). 31. 2/y, 
2/x, x/y, y/x”. 36. That h and i determine p, that is, that heredity and environ- 
ment determine the personality. 37. That the tax is determined by the tax 
rate, the total income, and the individual. Not if we wished to indicate 
explicitly all the factors determining a person’s tax. 38. This has been done in 
the paragraph preceding Exercise (v). 
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2-10 Quantifiers. As we have seen, a sentence may be true for all, 
some, or none of the significant values of its variables. If we wish to 
claim that a sentence is true for all values of its variables, we can simply 
display it as a theorem according to the convention explained in Section 
1-8. However, it is convenient to have special symbols to indicate how 
many of the values of a sentence are true propositions. We call such 
symbols quantifiers. 

We adopt the symbol Vz to mean “for all significant values of x.” Thus, 
Ve p = (for all significant values of 2, p). Usually we say “For all x, p” 
or “pis a law.” If f(x) is a sentence involving no variables other than z, 
then to claim that Vz f(x) is the same as to claim that f(z) is a law. For 
example, Vz 2? = x-x is true because x? = x-az is a law. Similarly, 
Vp (p — p) is true. We call V the universal quanizfier. 


State in words: (a) Vaf(z), (Ὁ) Ve~ f(x), (0) ~V2z f(x), 
(d) αὐ ~ f(x). 
True or false? (6) Vax —2x2 = 0, (ἢ Va(a+ 1)? = 27+ 1. 


i 


We adopt the symbol jz to mean “for some values of x” where “some” 
is understood to mean “one or more.” This meaning of “some” 1s universal 
in mathematics. In mathematics, some = at least one. We also read jz as 
“there exists an x such that.” For example, (dz 2” = 0) = (for some value 
of x, 2? = 0) = (there exists an x such that x7 = 0). We call 3 the 
existential quanitzfier. 


State in words: (g) Jzf(z), (h) 3a~ f(x), (Ὁ ~ 32 7(.), 
() ~ daz ~ 70). 

True or false? (k) 3:5 =2x+1, (ἢ 3χ χ ¥ 0. 

Express, using quantifiers: (m) ~~p = p is a law, (n) p> ὦ 1s 
sometimes true, (0) p A ~p is never true. 


We take Vx f(x) as undefined and define jz f(x). It is to be understood 
that significant substitutes for f(x) in Vx f(x) are only sentences or formulas 
standing for sentences. 


(1) Def. (ax f(x)] = ~Wx ~ f(z). 


This definition merely reflects the idea that to claim that f(x) is true 
for some xz is the same as to say that it is not false for all x! Several examples 
have appeared in the exercises above. 

Still further light is shed on the definition and the meaning of quantifiers 
by considering the special case when the quantified variable has only two 
significant values, say a and ὃ. Then Vz f(x) = [f(a) and f(6) are true] = 
[f(a) A f(b)], and 35 f(x) = [at least one of f(a) and f(6) is true] = [f(a) V 
f(b)|.. Then νῷ ~ f(z) = ~[ ~ f(a) A ~f(b)]. But the right member, 
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by De Morgan’s law, is synonymous with f(a) V f(b), that is, with 
dx f(z). Hence (1) is verified in this case. 
The following theorems are easily derived. 


(2) ~lvx f(x)] = 30. ~ f(x), 
(3) ~[ax f(x)] = Vr ~ f(z). 


(p) Illustrate (1) through (8) for f(z): 2? > 0, and for (4) f(z):2 A οἱ. 
(r) Give reasons in the following proofs of (2): 32 ~ f(z) = ~ Vaz ~ (~f(z)) = 
~ Vx f(x), and (3): ~ 3. f(z) = ~(-V2 ~ f(x)) = Vx ~ f(x). 


We now state an axiom that embodies the essential property of the 
universal quantifier. 


(4) Ax. [Va f(x)] > fy). 


In words, if a sentence is true for all values of z, it is true for any chosen 
value. In (4) we may substitute for f(z) any sentence involving z, and 
for y any constant that is a significant substitute for x in f(x). For ex- 
ample, if we make the substitution [f(x):r/z = 1, y:2], we have 
[Va a/x = 1] > 2/2 = 1. 

The axiom (4) embodies formally the Rule of Substitution, because it 
says that if a sentence is a law, then any sentence obtained by a significant 
substitution 15 also a law. It permits us to “drop” the universal quantifier. 
Our convention about displaying laws, discussed in Section 1-8, simply 
means that we write laws in a form ready for substitution. If we did not 
have such a convention, we should have quantifiers in front of all laws 
and be obliged to apply (4) whenever we wished to substitute. If 
we did not have the convention of Section 1-8, we should write (4) as 


Vy [Vx f(x)] > fy). 
(5) f(y) — [Bx f(x)]. 


Theorem (5) says that if a sentence is true for a particular value of 
its variable, it is true for some value. With all quantifiers shown, it is 
Vy [f(y) — [42f(x)]]. It is proved by the substitution (f(x):~f(x)) in (4) 
and then the use of (p — 4) = (ὡς — ~p). 


(s) Write out the proof just outlined. (0) Treat (2) through (5) in the case 
of only two significant values of 1. 


A theorem of the form Jz Ὁ is called an existence theorem. By (5), to 
prove dz f(x), it is sufficient to exhibit a significant value of x, say y, such 
that f(y) is true. Proofs of this kind are called constructive. For example, 
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(6) χα - ὦ Ξξ a. 


Proof: 
(aja +0=a (1-9-22), 
(b)a+O0=a—-idra+ur=a Sub, (5) (f(x): a + 2, y:0), 


(c) Jraa+z=—a Inf, (b), (a). 


(u) Similarly prove that Jra-x = a. 


Another way in which existence theorems are sometimes proved is to 
assume the contrary and derive a contradiction. Because of (1), to prove 
Jz p, one may show that ~(Vx ~ p), that is, that Vr ~ p is false. Such 
indirect existence proofs are called nonconstructive. 


(v) Prove (6) using indirect proof to establish ~ Vra - x γέ a. 
(7) ~fly) > [~Ve f(x)], 
(8) [Vax f(x)] — [Bx f(x)]. 


(w) Prove (7) and (8). 


If we know a law of the form 32 f(x), we know that f(x) for some value 
of x, but we may not know any particular solution. However, it would 
seem legitimate to adopt some name for the unknown value (or for one of 
the unknown values) of z, provided we do not assume anything about 
this value except that it is a solution of f(z). For example, if we know 
that [3x x has run the mile race in less than four minutes], we may draw 
some conclusions about whoever has done so without knowing who it 1s. 
In carrying on such a discussion it would be convenient to say: Let A be 
a person such that A has run the mile race in less than four minutes. 
Then we might say some things about A, such as that he must have been | 
in good physical condition at the time. These considerations lead us to 
adopt the following additional rule of proof. 


Rue or Cuotce: If a law asserts the existence of a solu- 
tion of a sentence ‘‘f(x),’”’ then “f(c)” may appear as a step in a 
proof, provided “‘c” is treated as a constant and no other as- 
sumption ts made about 2. 


(9) 


Like the Rule of Hypothesis, this rule could be omitted at the cost of 
lengthening and complicating proofs considerably. It is merely a con- 
venience, and any proof using it could be replaced by one rene only on 
the previous rules. We shall see later how it is used. 
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PROBLEMS 
Prove or disprove 1 through 6. 
1. Ve2r/e = 1. 2. Ixa/x = 1. 
3. Vz ~ (x? = 21). 4. Jax = 21. 
δ. w dea? Axe. 6. Janz? = α "1. 


Express 7 through 12 in ordinary language. 


Express 13 through 18 in symbols two ways, one using 3 and the other V. 


13. x = x-+ 1 is false for all z. 

14. x* = z is not always false. 

15. «* = x is sometimes true. 

16. “All plant and animal tissues are made up of units known as cells... . 
All cells contain a living substance known as protoplasm.” (Harvey E. Stork, 
Studies in Plant Life, 1945, p. 19) 

17. [1] = 2 is sometimes false. 

18. There is an x such that for all y y/y = xz. 


19. Complete the following verbalizations of (2) and (3): (2): To say that 
it is false that f(x) is true for all z is to say that there is some x for which f(z) 
ig ___._.. (3): To say that 5. that _____ is true for some x 
is to say that f(z) is _______ ἴυ.. ς᾽ Ζ. 

20. Write out (1) through (7) for the case of three values of xz: a, ὃ, and c. 

21. What is the scope of the first ~ in (1)? of Vz? 

22. Would (1) mean anything different if we substituted y for z in only the 
left member? in both members? 

23. What theorem in this section expresses the idea that to prove that a 
sentence is not a law it is sufficient to cite a counterexample? 

24. Which theorem expresses the idea that in order to prove that a sentence 
is sometimes true it is sufficient to cite a case in which it is true? 

25. Prove that it is false that all natural sciences are based primarily on 
controlled experiment. 

26. Prove ~ Vx ( < 1,000,000). 

27. Prove Val/a 5έ 0. 


Quantifiers are helpful in clarifying the meaning of statements involving 
“all,” “some,” and “none.” For example, 


(10) All squares are positive = Vrz? > 0 = ~ (10’). 
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(10’) Not all squares are positive = ~ Vrz? > 0 = ~ (10) 


3. ~ (x? > 0) 


= Some squares are not positive. 


(11) Some squares are positive = ira? > 0 = ~ (11’). 


(11) No squares are positive = ~ 3: 52 > 0 = ~ (11) 


= Va ~ (x2 > 0) 


All squares are not positive. 


Often the last form in (11’) is used carelessly when (10’) is meant. Such usage 
is incorrect (see H. W. Fowler, A Dictionary of Modern English Usage, under) 
“not”). To avoid confusion it is best not to use the last form in (11’). 


28. Justify the identities (10) through (11’). 

29. Which of the sentences within the identities (10) through (11’) are true? 

30. Repeat (10) through (11’) with the sentence “All men are mortal” 
(Vz xz is aman — Ζ 15 mortal). 


Taking the negation of a quantified statement is easy if one can express it 
with quantifiers as above and then make use of laws of quantification. It should 
be noted that because of (1) through (3) and the flexibility of the spoken 
language, there are many ways of stating each idea. The solution to any con- 
fusion may be found by translating into mathematical terms, re-expressing, 
and translating back, being sure at each translation that the meaning 1s un- 
changed. Express in normal English in two ways the negation of each of 31 
through 34. 


31. All Negroes favor an end to segregation. 

32. Some Negroes favor an end to segregation. 
33. No Negroes favor an end to segregation. 

34. Not all Negroes favor an end to segregation. 


Express 35 through 43 symbolically. Then express the negation in normal 
English in two ways. 


35. Nothing is good enough for him. 

36. Not all white people favor segregation. 
37. He is always happy. 

38. He is not always happy. 

39. He is always not happy. 

40. He is sometimes happy. 

41. He is sometimes not happy. 

42. “All that glisters is not gold.” 

43. There’s nothing new under the sun. 


The following laws express formally the meaning of equality. 
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y] <> Vf (f(z) = f(y], 
y] «Ὁ Vf [f(z) - fty)]. 


(12) [x 


(13) [x 


44, What rule of proof corresponds to (12)? 
%45. Prove that the left member of (12) implies the right. 
* 46. Prove the converse by letting f(z) = z. 
*47. Why do these two results prove (12)? 
* 48. Prove (18). 
49, Write out the laws of quantification for the case of four values of z: 
a, ὃ, c, and d. 
* 50. Justify the following: 


(14) Ax. [Va f(x) A g(x)] = [να f(x)] A ive g(x)], 
(15) [ax f(z) V g(z)] = [ax f(x)] V [351 g(2)], 
(16) Ax. [Va f(x) > g(x)] > [(va f(x)) > (νὰ g(x))], 
(17) [va f(x) > g(x)] — [πὸ f(x)] — [ax g(a)]}. 


51. Show that (14) and (15) are false if V and A are interchanged. 


ANSWERS TO EXERCISES 


(a) [For all x, f(x)] = [Every significant value of z is a solution of f(x)] 
[f(z) is always true] = [f(x) is a law]. (Ὁ) [For all z, not-f(x)] = [f(z) is 
always false] = [No value of x is a solution of f(x)] = [f(x) is never true] = 
[For no z, 7(4)). (ὁ) [For not all 2, f(x)] = [Not every significant value of x 
is a solution of f(x)] = [f(z) is not always true] = [f(x) is notalaw]. (ἃ) [For 
not all x, ~f(x)] = [Not every significant value of x fails to be a solution of 
f(x)] = [f(x) is not always false] = [the negation of f(x) is not a law] = [For 
some 2, f(z)]. (6) 1. (f) 0, since the sentence is false for x = 1. 

(g) [For some 2, f(x)] = [At least one significant value of z is a solution of 
f(x)] = [f(x) is sometimes true] = [There exists an x such that f(x)] = [~f(z) 
is not a law] = Exercise (ἃ). (8) [For some z, not-f(x)] = [Some significant 
value of x is not a solution of f(x)}] = [f(x) is sometimes false] = [There exists 
an x such that ~f(z)] = Exercise (c). (i) [For no zg, f(x)] = [No significant 
value of z is a solution of f(x)] = [There exists no x such that f(x)] = Exercise 
(0). (7) [For no z, not-f(z)] = [No significant value of x fails to be a solution 
of f(z)] = [There exists no x such that f(z) is false] = Exercise (a). (Κ) 0. 
(1) 1. 

(m) Vp~~p = p. (0) 3ρ᾽ -Ὁ “Ῥ. (0) “ΞΡ»ΛΡρ orVp~ (pA ~p). 
(p) (For some x, x7 > 0) = (It is false that for all z, 22 < 0). (It is false 
that for all xz, σῷ > 0) = (For some z, 2? < 0). (There is no x such that 
τ > 0) = (For all z, x2 < 0)ὲ. (r) (1) and ~w~p = p. (8) f(y) > 
~l~fy)] — αἰνὰ ~ f(x)] — 3. f(z) by (2-5-6), (2-7-8), (4), (2-5-28), (1). 
(t) (2), (8), (4), (5) become (2-5-7), (2-5-8), (2-7-6), and (2-7-9). 
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The results of Exercise (t) should not be surprising. The universal quantifier 
asserts the conjunction of all the values of the formula that follows it, since 
Va f(z) means that every value of f(x) is true. Hence Vz f(x) = f(a) A f(d) 
A f(c) A ..., where the values of x are a, ὃ, c,.... Similarly Jz f(x) = 
f(a) V Ff) V fle) .... 

(u) (1-9-25). (v) Wea+2 # a) - (Wea # 0) — (0 ¥ 0); by (2-5-28), 
(0 = 0) —~ (~Vrata xa). (w) (7) = (4) by (2-5-28); (8) follows from 
(4), (5), and (2-7-12). 


ANSWERS TO PROBLEMS 


1. True. Zero is not a value of z here. 3. False. 22 = 2-2. 5. True, since 
Va ~ (2? ~*~ x:2). 7. There is a number equal to its own negative. 9. Not 
all numbers are equal to their absolute values. 11. If the squares of two num- 
bers are equal, the numbers are equal or one is the negative of the other. 
18. Vex Aatli~wdre τ 21- 1. 15. 3χ 12 =2,~ νὰ 22 τέ χ. 17. 31 [1] ¥ 
t;~Vx\|z| = 2. 19. False; false; f(x); false; all. 21. [vx ~ f(x)]; ~f(2). 

28. Use the theorems with (f(x):z2 > 0). 29. x? = 0 for x = 0; otherwise 
z* > 0. 30. All men are mortal = ~ (Not all men are mortal) = ~ (Some 
men are not mortal). Some men are mortal = ~ (No men are mortal). 31. Not 
all Negroes favor an end to segregation = Some Negroes do not favor an end 
to segregation. 32. No Negroes favor an end to segregation = All Negroes 
are against an end to segregation. 33. Some Negroes favor an end to segrega- 
tion = Not all Negroes do not favor an end to segregation. 34. All Negroes 
favor an end to segregation = No Negroes do not favor an end to segregation. 
35. ~ Jzxzis goodenough. 37. Vi Heis happy at timet. 42. ~ νὰ 2 glisters --ῦ 
zis gold. (¥Vzrz glisters — x is not gold). This is an example of improper 
usage. 44. Replacement. 


*2-11 Multiple quantification. The most useful applications of quan- 
tifiers occur when more than one variable is quantified. We have the 
following possibilities for two variables. 


(1) Vy Vz... = For every y, for every z.... 

(2) Vz Vy... = For every x, for everyy.... 

(3) Vx Jy... = For every x there exists a y such that.... 

(4) Vy Jz... = For every y there exists an x such that.... 

(5) Jy Vx...—= There exists a y such that for every z.... 

(6) 3. Vy... = There exists an x, such that for every y.... 

(7) dx Jy... = There exists an x such that there exists a y such that . . 


(8) AWy Jv... = There exists a y such that there exists an x such that... 
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Read (1) through (8) with each of the following in place of the dots, and 
decide on truth values: (a) x+y = 22, (Ὁ) zy = x. 


Exercises (a) and (b) suggest that (1) = (2) and (7) = (8), so that the 
order of quantification is immaterial when only like quantifiers are in- 
volved. But clearly no two of (3) through (6) are synonymous, so that 
unlike quantifiers are not commutative. 


(9) Ax. ἵνα Vy f(z, y)] = [Vy V2 f(z, y)], 
(10) [ax dy f(z, ν}} = [By 35 f(a, y)). 


To avoid unnecessary parentheses, we follow the convention that the 
scope of a quantifier is the entire following expression or extends to the 
end of the smallest parentheses within which it lies. This convention is 
illustrated in (1) through (8) where, for example, Vy Vx... = Vy[Vz.. 1, 
in (2-10-1) where the right member means ~[Vz[~f(zx)]], and in (2-10-8) 
where the parentheses are essential. Because (1) = (2) and (7) = (8) 
we usually read Vx Vy as “for all x and y” and Jz Jy as “for some xz and 


y.” We also adopt the following abbreviations. 
(11) Def. Vay f(x,y) = Va Vy f(a, y), 
(12) Def. dz,y f(x, y) = Ax dy f(a, y). 


It is easy to extend the laws on negation to two quantifiers by simply 
applying (2-10-2) and (2-10-3) carefully. 


(13) [~ Va Vy f(x, y)] = [Ax dy ~ f(z, ψ)], 
(14) [~ Vax dy f(x, y)] = [Ax Vy ~ f(z, y)], 
(15) [~ ax Vy f(x, y)] = [νὰ dy ~ f(x, ψ)], 
(16) [~ 3. ay f(x, y)] = [Vx Vy ~ f(a, y)). 


(c) Prove (13) through (16). (4) Negate each statement made in Exercise 
(a) and compare truth values. 


Numerous other quantifiers could be defined, but only one is in general 
use. It 1s 312, which is read “for one and only one value of x” or “for just 
one x.” 


(17) Def. [A!z f(x)] = [3x f(x)] A Vay [F(2) A fly)) > x = yl, 
(18) [d!a f(x)] — [32x f(2x)], 
(19) [~ dla f(x)] = {νὰ ~ f(x)] V ayy (f(z) A fly) A y # x}}. 
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(e) Verbalize (17) through (19) and argue for them informally. (f) Discuss 
3j!z f(x) for the case where x has only two significant values and the case where 
it has only three. (g) Prove (18) and (19). (ἢ) Prove νν 3}. γ - 2 = y. 
(i) Does 3! commute with V or 3? 


A variable that is within the scope of a quantifier is said to be quantzfied. 
From the discussion in this and the previous section the reader is already 
aware that some care is necessary when dealing with quantified variables. 
Consider, for example, 


(20) 3. Vy x dislikes y, 


which says that there exists an x such that for all y, x dislikes y; that is, 
there exists a person who dislikes everybody. Now if we substitute x for 
y we have 


(21) Jz Vx x dislikes z, 


which says that there exists an 2 such that for all z, x dislikes x. If it 
makes sense at all, it certainly means something different from (20). On 
the other hand, if we substitute z for y in (20), we get 


(22) Jz Vz x dislikes z. 


This evidently has the same meaning as (20). We see that for a quantified 
variable we may safely substitute another variable only if the latter does 
not already appear within the scope of the quantifier. 

Since Vz f(z) means that the formula f(x) is true for all values of the 
variable x, the substitution of a constant for x would result i nonsense. 
Similar remarks apply to J and 3!. We see then that we cannot substitute 
constants for quantified variables. Of course, if Vz f(x) is a law, we can 
substitute a constant for x in f(x) standing alone, but not in the whole 
expression Vx f(x). For similar reasons it is not convenient to substitute 
formulas for quantified variables. 

If we recall the definition of “variable” in Section 1-3, we see that in the 
context Vx f(z), x is not a variable (since constants are not to be substituted 
for it), even though it is a variable in the formula f(x) considered alone. 
A symbol that is a variable in part of its context considered alone but is 
not a variable in the whole context is said to be a dummy in the whole 
context. For example, x is a variable in x — x = 0, but it is not a vari- 
able in Vx x — x = 0, so we call it a dummy in the second context. We 
limit substitutions for dummies to variables not appearing elsewhere in 
the same context. 

From the preceding discussion it is evident that a quantified variable 
isa dummy. In (20). both x and y are dummies. Hence (20) actually in- 
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volves no variables, which fits with the fact that when we express (20) in 
words it 1s seen to be a statement and not a propositional formula. How- 
ever, “Vy x dislikes y” involves the variable x, and it is a propositional 
formula. 


(j) When we use the Rule of Hypothesis, any variable in the hypothesis 
becomes a dummy. Why? (k) Why in using the Rule of Hypothesis is it 
legitimate to substitute a variable not already appearing elsewhere in the 
context? 


PROBLEMS 


Letting S = x is the father of y, whére x and y are men, state 1 through 18 
In words and decide on truth values. 


1. Vx Vy S. 2. Vy Va S. 3. Vx dy 5. 

4, Vy iz S. 5. Jn Vy S. 6. Jy Vz S. 

7. dx dy 8S. 8. dy 32 5. 9. Va ily δ. 
10. Vy Ala 5. 11. diz Vy S. 12. aly V2 8S. 
13. Jr ity S. 14. dy 311 5. 15. diz dy S. 
- 16. Sly 35 δ. 17. 31. ily S. 18. ily dia S. 


19. Repeat 1 through 18 for S = the season x immediately follows the 
season y. 


State 20 through 26 symbolically. 


20. Every man has one and only one father. 

21. Some men are brothers. 

22. All men are brothers. 

23. There are two numbers whose product is 16. 
24. Every man has ancestors. 


25. Any even number greater than 4 is the sum of 2 odd primes (Goldbach’s 
conjecture). 
26. Ideas have consequences. 


27. Negate each of Problems 1 through 18. 
28. Do the same for Problem 19. 

29. ProveliltVyr+y =ytoue= ψ. 

80. Prove 3’ υψ αν = ya = y. 

31. Prove Va dlyz+y=yta2 = 0. 

32. Prove Vaz τέ Ὁ > Jlyzy = 1. 

33. Is d!x ly f(x, y) = Aly 315 f(z, y) a law? 


ANSWERS TO EXERCISES 


(a) F, F,T,T, F, F, T, T. (b) F, F, T,F,T,T,T, T. (c) Apply (2-10-2) 
and (2-10-3). (4) Apply (13) through (16); for example, [~ ayVax-+y = 
22] = [Vyixx-+y # 21] = [For every y there exists an x such that x + y 
does not equal 22]. (6) The right member of (17) reads “There exists an x 


128 ELEMENTARY LOGIC [cHAP. 2 


such that f(x), and if both x and y are solutions of f(x) then x and y are identi- 
cal.” The right member of (17) — its first term, by (2-7-6); hence (18) holds. 
To get (19) we negate both members of (17). The right member of the result, 
by De Morgan’s law, becomes ~ Iz f(z) V ~ V2,y [f(@) A fy) τοῦ = y|. The 
first term in this conjunction is Vz ~ f(z) by (2-10-3). The second simplifies 
as follows: Let p = f(z) A f(y), and let gq = (x = y). Then we have 
[~ V2 Vy p > 4] = [Ax ay ~ (p > @] = [x 3yp A “ταὶ by using (16) and 
(2-5-24). | 
(f) For two significant values, [3!x f(z)] = [f(a) V f(6)}. (6) Along lines 
indicated in Exercise (e). (8) Vyyt0= y. Hence Vyivy+a2 = y. 
Y¥te=yAyte? =yoyte=ytr)]—l[e =z]. (i) No. () We 
excluded substitutions in a hypothesis introduced by this rule. (k) It makes 
no difference what symbols we use so long as they are not otherwise involved. 


ANSWERS TO PROBLEMS 


1. Every man is the father of all men; 0. 2. Same as Problem 1. 3. Every 
man is the father of some man; 0. 4. Every man is a son of some man; 1. (This 
truth value and others below may be considered controversial by some, but this 
book is not concerned with theology.) 5. Some man is the father of all men; 0. 
6. Some man is a son of all men; 0. 7. Some man is the father of some man; 1. 
8, Same as Problem 7. 9. Every man has just one son; 0. 10. Every man has 
just one father; 1. 11. Just one man is the father of all men; 0. 12. Just one 
man is a son of all men; 0. 13. Some man is the father of just one son; 1. 14. 
There is a man who is a son of just one father; 1. 15. Just one man is the father 
of a son; 0. 16. Just one man is a son of some man; 0. 17. Just one man is 
the father of just one son; 0. 18. Same as 17. 

20. Vx [x is aman — 3!yy is the father of zx]. 21. 3x 3y 2 and y are brothers. 
22. Vz Vy x and y are brothers. 23. Jr 3yzy = 16. 24. νὰ [ὦ is a man —> (Ay y 
is an ancestor of x)]. 25. Vz[z is even A z > 0 > 3y32@ =yt2zA y and 
z are odd primes]. 26. Vx [x is an idea — Jy y is a consequence of x). 


2-12 Heuristic. Logic is concerned with deductions that are certain. 
As explained in Section 2-6, in a deductive theory all the theorems follow 
with certainty if the axioms, definitions, and rules of proof are accepted. 
Although strict logical thinking is very important in science and in daily 
living, it is by no means the only kind of worth-while intellectual activity. 
Indeed, without observation, experiment, discovery, and inspiration there 
would be nothing to be logical about! 

Before a theorem can be proved, it must be conceived and considered 
plausible. Before axioms can be used, they must be formulated. Before a 
theory can be built about certain concepts, basic terms must be created 
and defined. Before a logical theory can be constructed, its subject matter 
must be understood to some extent. Before an answer can be justified, it 
must be found. Such activities are exploratory and uncertain, as are all 
activities of creation and discovery. 
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The word “heuristic” is used as an adjective to describe activities 
serving to discover or reveal, including arguments that are persuasive and 
plausible without being logically rigorous. It is used as a noun to refer to 
the science and art of heuristic activity. 

Heuristic 15 obviously very important, and the high rewards that some- 
times go to its most successful devotees are well deserved. Why, then, is 
not more attention devoted to studying and perfecting this art? The 
most obvious answer is that, by definition, it includes just those things 
that have not yet been systematized and made scientific. As soon as we 
discover a formula for solving a particular kind of problem, we no longer 
need heuristic for dealing with it. It is possible, also, that most heuristic 
experts prefer to practice the art rather than to talk about it! 


(a) Look up “heuristic” in an unabridged dictionary. (b) Could heuristic 
be entirely formalized? 


We do not attempt here to discuss heuristic in any detail. Instead we 
make some brief observations and refer the reader to the very few books 
on the subject. Among these, the best one for a beginning is How To 
Solve It, by G. Polya (Princeton University Press, 1945, and also in a 
paperback edition issued by Doubleday, 1957.) In it we read: “The first 
rule of discovery is to have brains and luck. The second rule of discovery 
is to sit tight and wait till you get a bright idea.” A third rule might be: 
while waiting for brains, luck, and a bright idea, spend some time ex- 
perimenting, some time thinking, and some time resting. If there is any 
rule that applies to all heuristic situations, it is: observe, experiment, 
think, and rest. The greatest difficulty comes in combining thought and 
action. Some persons act without thinking. Others are afraid to act until 
they know the answer. In Shakespeare’s classic lines, 


Our doubts are traitors, 
And make us lose the good we oft might win 
By fearing to attempt. (Measure for Measure) 


Polya gives a very useful outline of steps that may be helpful in problem 
solving. The main steps are: (1) understand the problem, (2) devise a 
plan, (3) carry out the plan, (4) check and reconsider the results. Al- 
though the bare outline may not seem to be very helpful, Polya gives 
many practical suggestions under each step. The methods are applicable 
to all kinds of heuristic activity, both to problems where a simple answer 
is sought and problems consisting of discovering a proof. Perhaps a fifth 
step might be added: (5) clean up the problem. This suggestion would 
involve reformulating the problem and its solution in a logical and clear 
way after eliminating all the experiments and confusion of the heuristic 
process. Step (5) has usually been performed on the mathematics that 
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students see in books, which is the reason there is often no clue to how the 
results were discovered. 

It is possible to formulate heuristic rules that serve as guides in creative 
work. They differ from rules of proof or theorems of mathematics since 
they suggest possible lines of activity and do not claim to be laws. For 
example, 

Heuristic Rule: An identity may sometimes be proved by writing an 
identity whose left member is the left member of the identity to be proved 
and then modifying the right member by replacements until it 1s the same 
as the right member of the identity to be proved. 


(c) Give an example of our use of the procedure described in the preceding 
rule. (d) Suggest an obvious modification of the rule. (6) Formulate a heu- 
ristic rule to cover the cases in which we work with both sides of an identity. 
(ὃ Formulate other heuristic rules for doing proofs. Keep a list and add to 
it. (g) Read the following pages in How to Solve It: v-xv, 33-36, 102-103, 
118-122, 136, 194-197. 


Among the very few books on heuristic, the student will find the fol- 
lowing most helpful: 

How to Study—How to Solve, by H. M. Dadourian, Addison-Wesley, 
1951. Emphasizes problems in elementary mathematics. 

The Psychology of Invention in the Mathematical Field, by J. Hadamard, 
Princeton University Press, 1949. A fascinating little book by a great 
mathematician. 

Also interesting are: 

Mathematics and Plausible Reasoning (two ἘΣ by G. Polya, 
Princeton University Press, 1954. 

The Creative Process, by B. Ghiselin, University of California Press, 
1952. 

Problem-Solving Processes of College Students, by B. 8. Bloom and L. J. 
Broder, University of Chicago Press, 1950. 

Productive Thinking, by M. Wertheimer, Harper & Brothers, 1945. 


PROBLEMS 


Problems 1 through 12 are to be solved in the light of Polya’s discussion 
of heuristic and his outline. The answer should include an analysis of how 
the problem was solved as well as a justification of the solution. Of course, all 
problems in this book are exercises in heuristic. 


1. How much must a merchant mark up an article costing him $12 in order 
that he may later mark it down 25% from its sale price and still make a profit 
of 20% on it? 

2. A man agrees to build a garage to be owned jointly by himself and his 
neighbor, each to share equally in the cost. The man does all the labor himself, 
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working 200 hours, and the neighbor contributes $500 worth of material. If 
the man’s labor is worth $1.50 an hour, how should settlement be made? 

3. According to a recent news item, a mail-order house installed a $500,000 
electronic computer with which ten girls could do the work formerly done by 
150. Is this a money-saving move? 

4. Which would you rather have, four annual salary increases of $100 each 
or two salary increases of $200 each at two-year intervals? 

5. Did Greek authors of the pre-Christian era use footnotes? 

6. A lady asks how long she should make a valance to go all around a cir- 
cular table of diameter 3 feet 63 inches. What do you tell her? 

7. Fill in the blank in the following quotation from the Wall Street Journal. 
“Alden’s, Inc., voted a quarterly of 30 cents on the increased common, payable 
July 1, to stock of record June 14. A 50% common stock dividend was paid 
May 7. Before that 37 1/2 cents was paid quarterly on the old common. In 
announcing the increase, Robert W. Jackson, president, said the ‘previous com- 
mon quarterly dividend of 37 1/2 cents is equivalent to 25 cents based on out- 
standing shares reflecting a 50% distribution made May 7, 1954. The current 
rate of 30 cents, therefore, represents a % increase in the common stock 
dividend.’” 

8. A reed growing in the middle of a pond 8 feet in diameter just touches 
the edge when it is pulled over. How deep is the pool if one foot of the reed 
is above the water? (An old Chinese problem.) 

9. Find a formula for the long diagonal of a rectangular box of sides a, ὃ, and 6. 

10. Show that in chess it is impossible to reach a position in which all pawns 
of one color are in the same file. : 
%*11. In chess, a “won position” for White (Black) is one from which White 
(Black) can force a win no matter how Black (White) plays. A “draw position” 
is one in which either player can force at least a draw. It has been proved 
that every position in chess is a won position for Black, a won position for 
White, or else a draw. There is no known method of deciding which of these 
possibilities is the actual one in any case, however. If we modify the rules 
to permit a player not to make a move when it is his turn, it follows that the 
initial position is either a draw or a won position for White. Prove this last 
statement. 

12. A chess board can easily be completely covered by dominoes of a size 
that cover two adjacent squares. Show: that this cannot be done if two di- 
agonally opposite corner squares are removed from the board. 


Let δ, C, T, s, c, t be undefined terms satisfying the following axioms and 
subject to the laws of algebra of real numbers. 


(1) S?-+ C? = 1, 

(2) T= S/¢, 
(3) t = (75, 
(4) s = L/S, 
(5) c= L/C. 
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Prove 13 through 20. 


15: 2-1. = ο΄ 14.1+ ἐξ = s8?. 

15. CT = S. 16. Ss = Cc = Tt = 1. 
17. a — S\(e+ Τῇ = C. 18. e/(T+ =S. 

19. T2 = 5275 - §?, 20. (ce? + 87)/(sc) = T+ ἐ. 


21. An auditorium with 800 seats was not full one evening. Exactly $340 
was taken in. Adults paid 50¢, children 20¢. Prove that at least three times as 
many adults as children were present. 


Tf a sentence is true in a particular case we say that it is vertfied in that case. 
Verification is important. In empirical sciences it is the way in which scientific 
hypotheses are checked, but it should not be confused with proof. The physicist 
checks his hypotheses by attempting to verify them experimentally. If he finds 
a counterexample, he has to reject an alleged law. If he does not, he accepts 
the law tentatively. Usually he tries to prove the alleged law by mathematical 
methods from other laws thought to be correct. In short, he tries to prove his 
hypotheses by constructing a deductive theory, but this process is quite differ- 
ent from the process of checking by experiment and observation. 


22. Write a sentence that is true for 100,000,000 values of its variable but is 
not a law. 

23. Write one that is true for an infinite number of values of its variable, 
and also false for an infinite number! 

24. Cite an example of an. accepted scientific law and explain why it could 
not be verified for every case. 

25. Could verification by numerous cases ever prove a law? 


According to Webster, evidence is “that which furnishes, or tends to furnish 
proof,...” If p - 4, we call p conclusive evidence of g, but evidence for q in- 
cludes also any proposition that verifies, makes plausible, or otherwise supports 
belief in g. For example, F. H. Elwell in Elementary Accounting (p. 83) writes: 
“The fact that a trial balance ‘proves’ does not necessarily mean that the ledger 
accounts accurately record all transactions of the period. . . . In general, how- 
ever, the fact that the bookkeeper obtains a trial balance is usually accepted 
as evidence that the ledger accounts are correct.” 


26. Letting p = ledger accounts are correct, and g = books balance, which 
is true, p > gorg — p? 
27. In what sense is g evidence for p? 


Since evidence includes so much, it is impossible to indicate all types. Cer- 
tainly, however, when p — 4, q@ is used as evidence for p, and this is justified 
on the ground that the truth of ᾳ at least establishes conclusively that p may 
be true. 


28. Justify the last remark by showing the implications of q being false in 
such a situation. 
29. If p — q, is ~g conclusive evidence for ~p? 
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In legal practice evidence includes only such matter as conforms to the 
established rules of evidence. Direct evidence is evidence that “immediately 
establishes the fact to be proved by it,” while indirect evidence is that which 
“establishes immediately collateral facts from which the main fact may be 
inferred.” 


30. If q is the fact to be proved, p expresses the direct evidence, and r the 
indirect evidence, indicate symbolically the relations between p, q, and r. 

31. X is accused of stabbing Y. A testifies that he saw X stabbing Y. Direct 
or indirect? Conclusive? 

32. Z testifies that he saw X running from the scene with a bloody knife, 
that Y gasped out X’s name, and that Z saw no one else in the area. Direct 
or indirect? Convincing? Conclusive? 


The last question indicates the frequency, and indeed the necessity, of mak- 
ing decisions on the basis of evidence that is, strictly speaking, not conclusive. 
The law requires merely certainty “beyond a reasonable doubt,” which is much 
less certain than strict deductive proof. For example, in Problem 32 there may 
be doubts about Z’s honesty, his memory, his accuracy of observation, the 
meaning of Y’s last words, and the possibility of someone else having been 
present and committed the crime without Z’s knowing it. Nevertheless, the 
evidence is convincing because we feel it very unlikely that these things be 
accompanied by Z’s testimony. We ignore here the aspects of evidence that 
involve probability judgments, and consider merely examples in which (the 
evidence) — (the fact) and those in which it does not. Evidence of the latter 
type is usually called circumstantial evidence. 


33. Would you call Z’s testimony circumstantial? 


Let a = (X committed the crime), and ὃ = (X was present at the time of 
the crime). An alibi is a claim on the part of X to have been elsewhere, i.e., a 
claim that “Ὁ. 


34, Which is correct, a — 6 or b > a? 
35. Why is a valid alibi conclusive proof of innocence? 


Let G = the defendant is guilty, 1 = ~G, C = evidence for G such that 
G — C, but ~(C — G), and Εἰ = the evidence such that E —- G. 


36. Write various relations between G, J, and E. 

37. What is the basis in logic for the requirement that the prosecution must 
prove G (“assumed innocent until proven guilty”) rather than the alternative 
of requiring the defense to prove 1 (“guilt by accusation”)? In this connection 
consider the possibility of both truth and falsity for Εἰ and C, and also the 
possibility that all evidence is inconclusive. 


CHAPTER 3 
ELEMENTARY THEORY OF SETS 


3-1 The set concept. When we think about several things, we may 
consider them individually or we may view them as a single whole, class, 
collection, or aggregate. Mathematicians call such a whole a set, and they 
refer to the individual objects of which it is composed as members that 
belong to the set. 

Words that designate sets are called collective nouns by grammarians. 
In ordinary discourse, collective nouns are used sometimes to refer to a 
set and sometimes to its members. For example, we may say “That 
bunch of grapes is heavy” or “That bunch of grapes have thick skins.” 
The first sentence refers to the set, the second to the individual members. 
In mathematics, the name of a set refers only to the set as a whole. 

An example of a set is the United States Senate. Its members are 
senators. Other examples of sets are the Los Angeles Dodgers, the planets, 
the living ex-Presidents, the even numbers, the possible opening moves 
in a game of chess, and the possible outcomes of an experiment. 


(a) Describe the members of each of the sets named in the last sentence. 
(b) Among words used as variables to stand for sets are “collection,” “flock,” 
“herd,” “bunch,” and “crowd.” Indicate the kinds of sets described by these 
words, and give other examples of collective common nouns. 


The set concept plays a very important role in mathematics, and indeed 
in all clear thinking. It is one of the basic concepts in terms of which other 
mathematical ideas are explained. The purposes of this chapter are (1) to 
familiarize the student with the concepts and notations for dealing with 
sets, (2) to show the relations between sets and sentences, and (3) to apply 
set theory to further development of elementary algebra and to linear 
analytic geometry. 


ANSWERS TO EXERCISES 


(a) Baseball players; planets; Hoover, Truman; ...—6, —4, —2, 0, 2, 4, 
6,... 3 any one of 20 different moves; say, different readings on a dial. 


3-2 Designating sets. Since we frequently wish to say that an object 
is a member of some set, we adopt the special notation a Ε C to mean that 
a belongs to C. This is an informal definition. The symbol a €C will be 
introduced as a basic term of the theory. We write a¢C to mean that a 
is not a member of C. 
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(a) Define a Z C formally in terms of a € C and logical symbolism. 


Suppose we wish to designate the set whose members are just the 
numbers 1, 2, and 3. We wish to distinguish between the set considered 
as a whole and the individual members. Hence it would be unsatisfactory 
to write “the set 1, 2, and 3.” We could, of course, always write “the set 
whose members are 1, 2, 3,” but it is convenient to have an abbreviation. 
We adopt the device of writing a list of the members within braces. For 
example, {1, 2,3} = the set whose only members are 1, 2, and 3. Note 
that the braces are read “the set whose members are... .” 


Describe in words: (b) {1,2}, (ce) {2}, (Ἃα) {a,b,c}, (e) {0,1}, 
(f) {0}. 


We call the above way of designating sets the roster notation. We refer 
to the list of members that appear within the braces as the roster of the set. 
Note that the roster is an expression made up of names of the members; 
it is not a name of the set. | 


Use the roster notation to designate the following sets. (g) The positive 
integers less than 5. (ἢ) The solutions of 2x = 5. (i) The three major 
religious faiths in the U.S. (ἡ) The digits greater than 2 and less than 7. 


The roster notation is very convenient for sets with only a few members. 
However, there are sets with many members (e.g., the population of the 
world) and even with an infinite number of members (e.g., the points on a 
line). For such sets the roster notation is awkward or impossible. To think 
of some things as a set we do not need to know or list every member. All 
we need to know is some sentence that tells us which things are members 
and which are not. That is, we need to know of some sentence like “A 
thing is a member of the set if and only if the thing... ,” for example, 
“A person is a citizen of the U.S. if and only if the person is born in the 
U.S. or is naturalized.” This sentence completely determines the set 
called “the citizens of the U.S.” even though we would hardly care to list 
all the members. Replacing “a thing” by “zx,” a rule for designating a set 
takes the form “zx is a member of the set if and only if x... ,” where the 
dots indicate the rest of some sentence of which z is the subject. In our 
example, we have “sx is a citizen if and only if x is born in the U.S. or x 
is naturalized.” Letting f(x) = (ὦ is born in the US. or is naturalized), 
we may say that (the citizens of the U.S.) = (the set of x’s such that f(z)). 


Complete the following: (k) The student body of this college = the set 
of z’s such that x ___.._.__— (1) The living ex-Presidents = the set of 2x’s 
such that « ____.___ and 2 


We now adopt an abbreviation for the formula “the set of x’s such that 
f(x)”, namely, {2|f(x)}. In this notation, the braces are read “the set” and 
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the vertical bar is read “such that.” For example, {x|2x = 1} = { 
since the only x such that 2x = 1is 4%. (It ould be noted that 13} x 3; 
the first being the set whose one ἐπε τες is 4, the second being 4 itself.) 
When a set is designated in this way, we call the sentence that appears 
after the vertical bar a defining sentence of the set. We say a defining sen- 
tence because the same set may have many defining sentences. Thus 
[21 ἘΞΕ:: 1] Pape a4 a ἰσ θα —L=a2}=.... 

re a set by a defining sentence is an entirely general method 
that applies to any set. If we know the roster of the set we can always use 
the designation “the set of all x such that z is one of the items listed on 
the roster.” Thus {1, 2, 3} ees 1 ξεν So oT ave 
do not know the roster, we must iow some characteristic that determines 
the members of the set (unless we do not know what set we are talking 
about), and we need merely state a sentence embodying the requirement 
that members possess this characteristic. 


Designate the following by words or by the roster notation: 
(m) {aja+4= 5}. (n) {2]a =3 Vx = 4}. (0) {z]z has no living 
parents, x is a child, and z is living}. (p) [21 has been convicted of a crime}. 

The set of solutions of a sentence f(x), that is, the set {x|f(x)}, is called 
the truth set of f(x). For example, the truth set of 77 = 4 is {2, —2}. 

The set of all values of a term is called the range of the term. For ex- 
ample, the range of x in 1/z is all numbers other than zero. If the range of 
x is the real numbers, the range of x? is the non-negative real numbers. 
As indicated in Section 1-3, the range of a variable or other term may be 
specified explicitly or indicated by the context. The range of a variable 
in a formula containing just one variable is called the domain of the for- 
mula. For example, the set of values of x in “the father of x” 1s called the 
range of x and the domain of “the father of x.” 


(q) If the domain of a sentence is the same as the truth set of the sentence, 
what can we conclude? (r) What is the range of x in 1/(l1 — 2)? (s) in 
MEX 2 


If we look at the equation {1,2} = {a|x = 1 V x = 2}, we note 
that there are no variables on the left. Indeed, the left member is a name 
of a quite definite set and is therefore a constant. Hence the right member 
must also be a constant. How do we explain the presence of what is ap- 
parently the variable “x”? Certainly “x” is a variable in the sentence x = 
1 V x = 2, but in the context of the entire right member it is not a 
variable. When we say “the 2’s such that x = 1 V x = 2” we mean 
“the solutions of the sentence x = 1 V x = 2.” ‘To substitute a con- 
stant for “x” would make no sense. Accordingly, “x” is a dummy as de- 
fined in Section 2-11. We may substitute for it any variable not already 
appearing in the context without changing the meaning of the expression. 
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Thus {z|2x = 1} = {y|2y = 1}. But we may not substitute for it a 
constant or some variable already appearing, for example, {x|2x = a} = 
ἰα 2) = {y|2y = a} τέ {a\2a = a} = {0}. 


Show that the following are laws. (0) {zjax-+6 = 0} = {—b/a}. 
(u) {x|(@ — αγώ — δ) = 0} = {a,b}. (v) {2|x? = a?} = fa, —a}. 
(w) (α] αὐ = a?} = allnumbers. (x) {ala? = x2} = {z, —a}. 


We take “a Ε S” and “{x|f(xz)}” as basic undefined formulas. An ex- 
pression of the form “a Ε S” is a sentence if the substitute for “S” is a 
term whose values are sets, provided that certain limitations are placed 
on the substitutes for “a.” It is necessary to restrict the substitutes for 
“a” to avoid nonsense and paradoxes, but the subject is too difficult to 
present here.* 

Common sense will suffice for our purposes. In “{z|f(z)}” any sentence 
is a significant substitute for “f(r),” and x isa dummy, as indicated above. 


Complete the following: (y) The values of aG S are ____ sf S ig a 
(z) The values of {z|f(z)} are ____ if f(z) is a 
PROBLEMS 


Identify the sets in Problems 1 through 9. 


1. (|ἰσχξ2νχ ε-ϑβν χ ὴῖιεάν χ:δν χεθν τ: --ῦν 5 - 
δϑνχξθν χξον .χ.πΞ|1). 

2. {ἢ} 2? ΞΞ ἢ). 

8. {2132 -Ἐ 22 — 1 = 0}. : 

4.J = {xjx =O0V (yyeJtA (4 =y V x = —y))}, where J+ = the 
positive integers. 7 | 

5. FE = {xltEJ A lyyEJ/ A & = 2y}, where J = the integers. 


6. ὃ = {z|/tEJ A cE FE}. 
7 {2elxEJ A JyyEJ Ax = 2y4+ 1}. 

8. Ra = {z|imnmeEJ A nEJ A x = m/n}. | 

9.P = {e\tETAr>1A vyz(yEedt A zEJt+ At = yz) Oo 


1V z= 1ὴ]}. 
“2 — 4 
10. Does (2[1 - 2 = 4} = te = 4: 


o~ 
<= 
I 


*11. It is said that a set can be defined either connotatively or denotatively. 
Which of these terms applies to the definition by roster and which to definition 
by a defining sentence? 
12. Which is correct: “2 € {1, 3}” is false, or “2 € {1, 3}” is not a sentence? 
13. Argue that the domain of f(x) is {x|f(z) = f(x)} and the range of f(x) 
is {y|vy = f(2)}. | 


* For example, if we admit “S Ε S” to be a sentence, we can prove any sen- 
tence, e.g., “(p A ~p),” to be a law. See “On the Theory of Types,” by W. V. 
Quine in The Journal of Symbolic Logic, Vol. 3, No. 4, December, 1938. 
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In Problems 14 through 20 use the defining-sentence notation to name the 
indicated set. | 


14. The set whose numbers are 1 and 2. 

15. The positive integers between ᾧ and ὃ. 

16. The positive integers that are greater than 1 and also less than 2. 
17. The States of the U.S. touching an ocean. 

18. The integers that are multiples of 3. 

19. The integers that leave remainders of 1 when divided by 7. 

20. Integers that are perfect squares. 


ANSWERS TO EXERCISES 


(a) ac = ~(a€c). (b) The set whose members are just 1 and 2. 
(ὁ) The set whose one member is 2. (4) The set whose members are just a, ὃ, 
andc. (e) The set whose members are just 1 and 0. (ἢ) The set whose one 
member is 0. (6) {1,2,3,4}. (Ὁ) {2.5}; note that we speak of “the solu- 
tions” even though the set turns out to have only one member. (i) {Catholi- 
cism, Protestantism, Judaism}. (1) {3, 4, 5, 6}. (k) is registered here. 
(1) is living; was but is no longer President. (m) {1}. (nm) {8,4}. (0) All 
orphans. (p) All felons. (q) The sentence is a law. (r) All numbers 
except 1. (8) Set of all men. (ὦ) —b/a is unique solution. (u) through 
(x) See Section 2-8. (y) sentences; set. (z) sets; sentence. 


ANSWERS TO PROBLEMS 


1. The digits. 8. {--} - ΨΩ, —1 —+/2}. 5. The even integers. 7. The 
odd integers. 9. The prime numbers. 13. Since f(x) = f(x) for just the values 
of x that have meaning in the context. 15. 1. 17. {z|zis a State A x touches 
an ocean}. 19. {zjzEJ A 3yyEJd Az = Ty + 1}. 


3-3 Special sets. In any discussion, we talk about some set of objects. 
The set of all things under discussion in any context is called the unzverse 
of discourse or universal set, and we shall designate it by U. For example, 
in elementary algebra the universe of discourse is usually the real numbers. 
From the members of U we can construct sets, which we call subsets of U. 
Then a ES if S is a subset of U and a is a member of S. Also {x|f(x)} is 
a, subset of U if the values of x in f(x) are members of U, that is, if the 
domain of f(z) is a subset of U. 

Suppose, now, that the domain of f(x) is a subset of U but that f(x) 
is false for all values of x. We still consider {2|f(x)} to be a set and a sub- 
set of U even though it has no members. We call such a set empty or 
vacuous. We define @ as follows. 


(1) Def. ᾿" Ὁ = (χ2[2 Κέ x}. 
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Clearly @ is vacuous; that is, we have the law 


(2) “(ὦ € 9). 
(a) Why? 


We call @ the null set. The word “the” is used because any vacuous set 
is equal to (ὁ. [See (8) below.] 

To prove (2) or any other theorem about sets, we need some axioms con- 
taining our undefined terms. The first axiom below embodies the essential 
idea in our notion of {x|f(x)}, namely that it is just the set of objects 
satisfying f(x). 


(3) Ax. la ΕΞ {x|f(x)}] = fla). 
(b) In (3) what are the variables, and what terms, if any, are dummies? 


Our next axiom indicates that we regard a set as just the collection of 
its members. To say that sets A and B are identical is to say that they 
have the same members. 


(4) Ax, (A = B) =Vr(xEAoxveB),. 


Now we can prove 
(5) A= {t|zeE A}, 
(6) [{x|f(z)} = {τ |σ(ῳ}}] = Vr f(x) © 96). 


This theorem tells us that two logically equivalent sentences define the 
same set, and conversely. Evidently every set A has many defining sen- 
tences, among which, by (5), Σ € A is one. 

The following axiom indicates the nature of the universal set. 


(7) Ax. C=. 14. S20 


(c) Could the second = in (6) be changed to <<»? (d) Give an example 
of three sentences defining the same set. (6) Give a sentence that defines 
the set of all statements. 


We can now prove that any vacuous set is identical with the null set ὦ; 
that is, 


(8) Vax€é ΑἹ oo [A = Ὁ]. 


Indeed, by (6), A = {x|xreE A} = {1|χ ~ x} — @ if and only if 
Valx # αὶ <> x EA]. But by hypothesis, the right member of this equiva- 
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lence is always false, and we know that ὦ γέ ὦ is always false. Hence 
« ~£ xand2e€A are logically equivalent for all 5, as required. 


(f) Show that 3!4 γα σε A. (g) We have seen that the null set is unique. 
Is this true of the universe of discourse U; that is, does (7) determine a unique 
U?  (h) What is {z|1/a = 0}? 


Now we introduce the roster notation by definition. 


(9) Def. fa} = [22 = a}, 
(10) Ax. a # {a}, 

(11) a © {a}, 

(12) dlzx € {a}, 

(13) ({a} = {b}) «» (a = ὃ). 


We call a set with just one number a singleton. Note how the “oneness” 
of a singleton is expressed by (12) without using numbers. 


(i) Why did we include (10) as a law? (j) What is the variable in (12) and 
what is the dummy? (k) Write each of (9) through (13) with quantifiers 
as you would if we did not have the convention that we omit quantifiers that 
stand before a law. 


(14) Def. fa,b} = {ala => a V zx = DB}, 
(15) Def. fa, b,c} = {x|xe {a,b} V x = ¢}. 


When a ~ b, we call {a, δ} a pair. Similarly, when a # ὃ, ὃ ᾽ξ c, and | 
a τέ c, we call {a, b,c} a triple. When designating sets by their rosters, 
it is customary to name each member only once in the roster. Hence, 
ordinarily {a, δ} is a pair and {a, b, c} is a triple. 


(1) Define {a, δ, c, d} and {a, ὃ, ε, d, e}. (m) Distinguish between a, {a}, 
“a,” {“α᾽}, and “fay.” 


(16) {a, a} = {a}, 
(17) {a,b,c} = {zj2=aVe=bV r= οἷ, 
(18) {a,b} = {b, a}, 


(19) [ία, ὃ) = {a 4}] ola@a=cAb=d)V @=dAb=d)], 
(20) [{fa} = {b,c}]] «Ὁ (α ΞΞ ὁ -- ὦ). 
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x Special legislation. Constitutions often prohibit special legislation, 
that is, laws that apply to a single community, individual, or group. One 
way of getting around this is to pass a law applying to a certain set of com- 
munities, but to word the defining sentence of the set so that it actually 
defines a singleton! Professor Ralph Fjelstad, after a study of the Minne- 
sota legislature, reported that in the 1953 session there were hundreds of 
“special bills couched in general terms.” For example, “Any village hav- 
ing a population of more than 1,300 but less than 1,500 persons according 
to the last applicable state or federal] census and having an assessed valua- 
tion of real and personal property in excess of $1,000,000 may....” It 
“so happened” that only one village, Aurora, met these qualifications. 
The formula for a law of this kind is “Any x such that f(z) may... ,” 
where one takes care that 3}. f(x) and f(a), where a is the individual com- 
munity, person, or group for which one wishes to pass special legislation 
“couched in general terms.” 


(n) Express the defining sentence in the previous example, using 2 as the 
variable. (0) Using the expression in Exercise (n), write symbolically that 
there is only one such village. (p) Use the symbols of this section to express 
the fact that this one solution is the village of Aurora. (q) Suggest other 
kinds of sentences that might define a singleton whose one member is a town. 
(r) Give, if you can, examples of similar methods of evading prohibition of 
legislation directed against special religious, racial, national, or political groups. 


PROBLEMS 


In Problems 1 through 6 identify the sets. 


1. {xlx” = 16}. 2. {121 +1 = 0). 

3. {117}. = 4}. 4, {x|(2 — 2)(2 + 1)(x + 1000) = 0}. 

δ. {2|1/x = 1/z}- 6. {alr = V9}. 

7. Does ὁ = 0? 

8. Show that {x{z > x} = {z|z-0 = 2}. 

9. Prove (2). 

10. Prove (5) and (6). 

11. Prove some of (9) through (13) and (16) through (20). 

12. Show that [A ¥ B] = 3x (@E AA xcEB)V @EBA χᾷ A). 

13. The following laws indicate the relation between quantifiers and set 
membership. Justify them. 


Ι 


(21) [va f(x)] = [{27.2}} = (172) = 7}}}, 


I 


(22) [ax f(x)] = [{x|f(x)} γέ QO). 
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14. Prove the following: 
(23) Iz = y] OVS (2 ESO yES), 
(24) [x = ψ] > [{S|zES8} = {Sly Ε 5}}. 


15. Verbalize (23) and (24). 
16. Prove (25). | 


(25) | ταῦ. 
17. Define A ~ B (for sets) without using = in the defining term. 


Axiom (10) formalizes our distinction between a set and its members. Failure 
to make this distinction results in confusion. The fallacy of taking a statement 
about the members of a set to be about the whole set is called the fallacy 
of composition. The opposite fallacy of mistaking a statement about a set to be 
a statement about its members is called the fallacy of division. 


* 18. Let A = the barrel of apples is red. Does A = the barrel € red things? 
*19. Let B = the barrel of apples is heavy. Does B = the barrel € heavy 
things ? 

% 20. What is the fallacy in the following? The even numbers, £, are divisible 
by 2; hence, EF is divisible by 2, and so Καὶ is an even number. 

*21. Explain the fallacy in the following: The Russians claim to have liqui- 
dated the kulaks, but this is obviously false since some of them escaped. 

% 22. What two sets of Americans were eliminated by the Civil War? Were 
their members eliminated? 

% 23. A corporation may be sued for debt. Mr. Jones is a member of the 
corporation, hence he may be sued. Valid? 

%* 24. In the eyes of the law a corporation is a person. A person may be sued, 
hence a corporation may be sued. Valid? 


ANSWERS TO EXERCISES 


(a) Vaa = x, hence Vz ~ (a ¥ x) and every z fails to satisfy the defining 
sentence. (Ὁ) “a” and “f” are variables, “x” is a dummy. (ὁ) Yes, since 
p=QgQ->peqg. (ἀ) « -- 8 Ξ 06, (ὦ -- 3)2 =0, w@=6 (e) rod. 
(f) Recall (2-11-17). JA Vax Α by (2). Suppose γχ χα A and Vaz € B. 
Then by (8) A = Ὁ and B = @. Hence the second term in the right mem- 
ber of (2-11-17) with (a:A, y:B) is satisfied. (g) No. U depends on the 
context. (8) Nullset. (i) To besure that “a € {a}” isasentence. (j) “a” 
is a variable, “x” adummy. (k) For example, (9) becomes Va {a} = {z|xz = a}. 

(Ὁ {α, ὃ, c,d} = {z|z2 € {a,b,c} V x = αὶ. (m) The object a, the single- 
ton whose one member is a, the letter “a,” the singleton whose member is the 
first letter of the alphabet, and a name of the singleton whose one member is a. 
(n) x is a village having a population... and having .. . $1,000,000. 

(o) Slzawisa.... (p) {Aurora} = [21 is a village...}. (4) There are 
many, but the trick is to find one that does not appear to name a single village! 
(r) Literacy tests, country-of-origin tests, etc., in certain circumstances. 
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ANSWERS TO PROBLEMS 


1. {4,~—4}. 3. {1,1}. 5. All nonzero numbers. 7. No! 0 is a number. 
It is the number of members in Ὁ, but it is not @ itself! 14. To prove that the 
right member of (23) implies the left member, assume the right member by 
Hyp, use (2-10-4) with (y:{2}) to get cE {x} «Ὁ νῈ {x}. Since x Ε {zx}, 
y © {x}, from which y = x. 17. See Problem 12. 


3-4 Subsets. We call the set A a subset of the set B if every member 
of A is a member of B. We use the abbreviation 4 C B. An alternative 
verbalization is that “B includes A.” 


(1) Def. ACB=VrlxE Axe Bl. 
(a) Read definition (1). (Ὁ) Show that: 
(2) [xl f(a} S {x]g(x)}] = Ve [f(z) > g(a). 


We may illustrate conveniently by using small sets. Let U = {1, 2, 8, 
4,5},A = {1, 2, 3}, B= {2,3},C = {1,4,5}. Than ACU,BCU, 
BCA, but ~(A CB) and ~(A CC). 

An alternative illustration arises from letting U = the set of points in 
the interior of the square in Fig. 3-1 and letting A and B equal the sets of 
points within the labeled circles. In the figure, A C B, A C U,and BCU. 


Figure 3~1 


True or false? (Give reasons.) (c) {1} © {1,2}; (4) {4,5}.¢ {1, 2, 8, 4, 5}; 
(e) {2} © {3,4}; (ἢ {2} Ε {2,3}; () {2} « (2, 3}. 


It is important to distinguish carefully between membership and in- 
clusion; as suggested in Exercises (f) and (g), they are quite different. It 
is possible to construct examples in which a member of a set is also one of 
its subsets. Thus {2} Ε { {2}, 2,3} and {2} C {{2}, 2,3}. The first is true 
because “{2}” appears in the roster. The second is true because “2” ap- 
pears in the roster, and hence {2} is a set made up of members of the given 
set. Jt should be kept in mind that € is used to indicate a relation between 
a member and a set, whereas C indicates a relation between two sets. 
Of course, a set may be a member of another set, as indicated in the 
examiple. 
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Given any set, we may form various subsets. For example, {1, 2, 3} 
has the subsets ὦ, {1}, {2}, {3}, {1,2}, {1,3}, {2,3}, {1, 2,3}. The set 
of all subsets of this set is then {@, {1}, {2}, {3}, {1,2}, {1,3}, {2, 3}, 
{1, 2,3}}. Note that the null set and the set itself are members of this set. 
We call the set of all subsets of A the power set of A. Let A be a women’s 
club. Then the power set of A is the set of all possible committees, in- 
cluding the committee of the whole (the club itself) and the committee to 
which no one belongs. We designate the power set by @(A). 


Find the power set of: (8) {1,2}, () {2,3}, (4) {1}, (Ὁ Ὁ, 
( {1, 2, 8, 4}. | 


The examples given suggest that the number of members in the power 
set of A is 24), where 91(A) is the number of members in A. Such is 
indeed the case. This is the origin of the name “power set.” 

Linear graphs. Let U be the set of all points on an axis. Because of the 
one-to-one correspondence between the points of U and the real numbers, 
we may think of U also as the set of all real numbers. Let f(x) be a sen- 
tence whose domain (range of x) is a subset of U. Then the truth set of 
f(x) is a set of points on the axis, a subset of U. We call this set of points 
the linear graph of f(x). Thus the graph of x? = 1 is the set of points 
(real numbers) {1, —1}. The graph of a sentence is then just a geometrical 
picture of its truth set. Here the graph of x? = 1 is a geometrical picture 
of [132 = 1}. The graph may be a finite set or it may contain an in- 
finite number of points. For example, the graph of x < 2 contains all 
points to the left of 2, as suggested in Fig. 3-2. We often use the defining 
sentence as a name for its graph. 


᾽ « 2 
ΠῚ ee θαυ ΕΞ πεν Laeenreee ΩΣ 
-2 --ἰ 0 ] 2 3 


FiGurRE 3-2 


Sketch the linear graph of each of the following sentences: (m) x? = 9, 
(nyc >—l, (0) e<38Vae>4, (pP)r<4Ae>3, (4) (ὃ -- 1) 
(¢+5)>0, (ἡ 22<0A 23> 8. (8) Describe @(U) where U = the 


axis of real numbers. 


Intervals of real numbers. We say that x lies between a and ὃ, where 
a<b, ifa<xAzxz<b. It is natural to adopt the following 
abbreviation. | 


(3) Def. (ὰ «ἀ <b) =]@< 2h & « δ). 
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We call {z|a < x < δ) the open interval from a to ὃ and designate it 
by (α δ). The word “open” refers to the fact that it does not contain 
its endpoints. 


(4) Def. (a_b) = (τ]ὰ < x < 5}. 


It is convenient to think of the set of all points to the left of a given 
point (all numbers less than a given one) as an open interval stretching 
“to infinity” toward the left. We designate it by (— οὐ _6), but the student 
should keep in mind that oo is not a numeral. Similarly, (a__o) is the 
set of all points (numbers) to the right of a. 


(5) Def. (~—o ὃ) = {11 « ὃ), 
| (6) Def. (a@__00) = {2[1 > a}. 


Describe in set notation, in words, and sketch: ({) (24), (u) (—1_1), 
(v) (@— €_a+e). 


When the endpoints belong to an interval, it is called closed and is 
designated by (α δ). 


(7) Def. (α “ α “Φ) κε- (. “χα λ 1 <b), 
(8) Def. (a_6) = {zJa <x < 6). 


When only one of the endpoints belongs, the interval is said to be 
closed on the right or on the left. Thus (α ὃ) is closed on the left and open 
on the right, (a__b) is open on the left and closed on the right. An infinite 
interval cannot be closed, since it does not have a first or last point in the 
direction in which it “stretches to infinity.” 


Give definitions of: (w)a<2z<b, ()α :. « δ, (y)a>ec>b, 
(2) (a_), (aa) (@_6), (bb) (a_~), (ec) (--οο δ). Sketch and de- 
scribe in words: (dd) (3_ 4), (ee) (—1_1), (ff) (0 ο), (gg) (--ὦ 0), 
(hh) (=100 100), (ii) (α -- ε ate). | 


PROBLEMS 
1. Find Φ {0}. 2. Find @ {1, 2, 3, 4, 5}. 
Graph Problems 3 through 10. 
3. ἃ] < 1. 4, u*+ 27 — ἃ = 0, 
5. 27+ 471+ 4 = 0. 6. 3x — 1 = 0. 
7.3r—-1<0. 8. τᾷ (—1_ 1). 


9. 27 < 0. 10. rE (—w_ ow), 
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In Problems 11 through 19 write inequalities synonymous to each expression. 


1. χε ὦ 3). 12.“ ἘπῚ2. 9); 13. 2 € (2 8). 
14, ΧΕ (-ΞΞῚ o), 15. 2€ (-τοὸ 58). 16. ,Ε (ΞΞῚ 5). 
17, .Ε (2_ 8). 18. χᾷ (--ὦ -οσἹ}. 19. ζξ (-[(-1 οὐ). 
Tell whether each of Problems 20 through 28 is true or false. 

20. 2 8) € (ὦ 3). 21. (1. 30) SE (4 | 1). 

22. (3_ 5) € (4 δ). 23. (1__ 5) € (2 6). 


24. Identify (3__ 3). 
25. Comment on (38 2), (2 2), and (2 ---οο). 
26. [52ς (ὦ 3)? 

*% 27. Name several subsets of the U.S. Senate. 


ANSWERS TO EXERCISES 


(a) “A is included in B” means that ae member of A is also a member of B. 
(b) (1) and (8-3-3). (c) T. (d) T (e) F. (f) F. (g) T. 
(h) {@, {1}, {2}, {1,23}. @ (ὦ, {2}, {3}, {2,33}. 6) (ὁ, {1}}. 

(k) {92}. (1) (Ὁ, {1}, {2}, (8}, {4}, {Π, 2}, {1,3}, {1,4}, {2,3}, {2,4}, 
{3, 4}, {1, 2, 3}, {1, 2, 4}, { 2, 3, 4}, {1, 3, 4}, {1, 2, 3, 4}}. (m) Two points, 
3 and —3. (n) All points to right of ποῖ (ο) ΑἹ points to left of 3 and all 
points to the right of 4. (p) All points between 3 and 4. (q) All points to 
the left of —5 and all points to the right of 1. (r) ΝᾺ] βού. (8) The set of 
all linear graphs. (t) {z|2 < x < 4}; points between 2 and 4. (w) a < 
tAN2<b. (Ὁ) {tlja<ax< δ) (bb) {z/x > a}. (ii) The closed in- 
terval from a — etoa-+ 6. | 


ANSWERS TO PROBLEMS 


1). δ. {—2}. 7. (—0__ 1/3}. 9. @ 11. 2 « “ « 8. 
13.2<271<3. 15 χ «8. 17. 1 «2Ν 21» 8. 19... Ξ --ΙἸ. 21. Ε, 


3-5 Ordering of the real numbers. In Sections 1-7 and 8-4 we dealt 
with inequalities in terms of their geometric interpretation without 
definitions or proof. In this section we use simple set theoretic ideas to 
give a more logical theory. 

Let Re be the set of all real numbers. We assume that there is a subset 
of Re, which we call Ret, with the following properties. 


(1) Ax. (a € Ret A b © Ret) -- (a+ δ) & Ret, 

(2) Ax. (ae Ret A δε Ret) > ab ε Ret, 

(3) Ax. a © Re — [(a € Ret) V (a = 0) V (—ae Re*)], 
(4) Deh α «Ὁ -- (ὁ — α)ε Ret, 
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(5) Def. a>b=bd <a, 
(6) a>O=ae Re*. 
(7) α «0- —ae Ret. 


(a) Prove (7). 


From (6), which follows immediately from (4)(a Ὁ, b:a), we see that 
Re* is just the set of positive real numbers. We may read “a > 0” 
(a ΕΞ Re™) as “a is positive” and “a < 0” (—a © Ret) as “ais negative.” 
With this intuitive idea of Ret, (1) through (3) are most plausible. How- 
ever, the proofs that follow do not depend on any interpretation. 


(b) Justify (8) through (13). 


(8) (@>0Ab>0)>5 (a+b > 0), 
(9) (a>0Ab> 0) > (ab > 0), 
(10) (a <0) V @=0) V (a> O), 
(11) (a > 0) = (—a < 0), 

(12) (a « 0) = (—a > 0), 

(13) (a « ὃ) -- ὁ -- 4» 0). 


We can now prove the following frequently used laws. 


(14) [[(α « δ) A (b< od] -Ὁ (a « ὦ, 
(15) (a « b) (ae. < δ᾽: Ὁ), 
(16) ec>0->(a< be ae < bo), 
(16’) σ «0 -" (α < be ae > be). 


The proof of (14) rests directly on (4). By hypothesis we have a <b 
and ὃ < ὁ, that is, ὃ — a Ε Ret and c — b € Ret. From (1) we have, 
then, (6 — a) + (ec — ὃ) € Ret ore —ac Ret. But this last 15 Just 
our conclusion by (4). | | 


*(c) Write out the preceding proof, with steps and reasons, in formal style. 
(d) Prove (15). (Suggestion: Write hypothesis and conclusion in terms of Ret.) 


To prove (16) and (16’), we begin with a < b, that is, b αε Ret, 
by hypothesis. Now, if in addition c > 0, that is, c € Ret, we have 
(6 — ajc Εἰ Re*, be —ac € Ret, or ac < be, as desired in (16). But if 
c <0, —e > 0 or —c & θ΄ by (12), and we have (b — a)(—c) € Ret, 
ac — be Ε Re*, or ac > be, as required in (16’). 


148 ELEMENTARY THEORY OF SETS τ [onap. 3 


An understanding of (16’) is essential for work with inequalities. Be- 
cause of it, inequalities cannot be treated as though they were equations! 

(e) Illustrate (16) and (16’) for (α:2, d:4, 6:2), (a:2, 6:4, c:—2), (a:—2, 6:4, 
c:—1). (Ὁ Restate (8) through (16’) in geometric terms by reading a < ὃ as 
“a lies to the left of δ." 


Laws (15) through (16’) can be used to solve simple inequalities. For 
example, 


(17) 38a +4 < bat lo --2. < —4 (15) 
(18) oxr> 2 (16’). 


(g) Show that mz + ὃ < 0<> 2 < —b/mis not a law, and write a correct 
law giving the solution of mz+b <0. (h) Solve 1 — 5z > 22-+ 4 as in 
(17) and (18). (i) What is the geometric interpretation of a < δῇ of a > δ᾽ 


(19) Def. (a<b)=(a<bVa= ὃ), 
(20) Def. (a>b)=(a>O0V a= δ). 


We can now easily prove that the square of any real number is non- 
negative; that is, 


(21) te Rex’ > 0. 


(1) According to (10), there are only three possible cases: x < 0, x = Q, 
a>0 If «<0, —*>0, and (—2)(—z) > 0 by (12) and (8). But 
(—x)(—x) = x”, so x” > 0 in this case. Dispose of the other two cases. 


Suppose we wish to solve the inequality 7? — 4 < 0. We have (x + 2) 
(x —2) «0. If +2 > 0, we may divide by it to get x —2 < 0. 
Hence if both x + 2 > Oand x —2 < 0, that is, if > —2andz < 2, 
z is a solution. Evidently any point in the open interval (—2__2) 1s a 
solution. But suppose x +2 <0. Then division yields x — 2 > 0. 
Hence if both x + 2 < 0 and x — 2 > 0 are satisfied, x is a solution. 
But this means x < —2 A x > 2, which is impossible. Another way of 
looking at the inequality is to observe that for (x + 2)(« — 2) to be nega- 
tive, one factor must be positive and the other negative, which yields the 
same result. 


(k) Solve x2 — 1 < Ointhis way. (1) Solvexz? —4> 0. 


The following theorems are helpful in solving quadratic inequalities. 


(22) a>de[a>O9Ab>D0VaA<D0AS< OO), 
(23) ab<O0el[a>OAb<0Va<0A5>O))). 
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Use (22) and (23) to solve: (τι) 2:2 -- 1 <0, (n) x? > 17, 
ὅχ τ 2 «0, (p) 274+ 38r+1>0, (q) ὦ — 3). - 4) < 0. 


PROBLEMS 


1. Show from (10) that if a ~ 0, thena > Oora < 0. 


(0) 2? - 


2. Consolidate in a single axiom the entire assumption ending with (1) through 


(3), including all quantifiers required. 


3. Show that a+ 1 > a is a law by proving that it is logically equivalent 


tol > 0. 
4. Show that ife > 0,a+e¢> a. 
δ. Show that a — 1 < a. 


6. Interpret the following verbally, argue for their plausibility, and give 


examples. 

(24) —  ~(a < a), 

(25) α «ὃ -. αἱὸ < a), 

(26) α “ α, 

(27) (α Ξ ὃ ΔΛ ὃ « ὦ --» (α = B), 

(28) (ᾳ Ξ ὃ λ ὁ Κ ἢ -- (α “ ὦ), 

(29) (a « δ) «» (—a > —), 

(30) (α « δ) «» (a—c «Ὁ -- ὦ), 

(31) (a « 0) «» (1/a < 0), 

(32) (a > 0) «» (1/a > 0), 

(33) (a <b) V (α = 6) V (a > 6), 

(34) (ab > 0) «» (a/b > 0), 

(35) (ab > 0) > (a < δε» I/a > 1/9), 
(36) (ab < 0) > (a < b> 1/a < 1/8), 
(37) αὐ - b? > 2ab, 

(38) @>0Ab>0)> [5:55 va], 
(39) (a > 1) > (a? > a), 

(40) (0 <a < 1) > (a? < a), 

(41) O0<a<bA0D0 <e < d) — (ac < δὰ), 
(42) (a@a<bAc<d>(at+e <db4 a), 


(43) | (0< a) > [a < ὃ «» a? « 53]. 
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*7, Any sentence involving <, >, <, or => can be rewritten by using the defini- 
tions (5), (19), and (20). For example, (29) yields ὃ > a<> —b < —a. Also, 
laws concerning < and > can be constructed by combining laws of equality 
and inequality. For example, (29), (2-6-11), and (2-7-37) yield a< ὃ «» 
—a > —b. Use these and other methods to find further theorems about in- 
equalities. 

*8. The arithmetic mean of a and 6 is (a+ 6)/2, and the geometric mean is 
V/ab. State (38) verbally. Prove (87) by showing that it is equivalent to 
(a — b)* > 0. Prove (88) by a substitution in (37). 


In Problems 9 through 12 identify the set. 


(2.2 — 2 < 09). 
(222 — 22 - 4» 0). 
(2 (ὦ — 7) - 4) < 0). 
(2 x(x + 1).,σ — 2) = 0). 


13. Show that (21) is equivalent to the statement that no negative number 
has a real square root. 


Problems 14 through 17 refer to the discriminant of a quadratic equation and 
the quadratic formula discussed in Section 2-8. 


14. Without solving, determine whether the following have real roots: 
2—7+1=0,277+5+1 = 0,27 — 10. -ἰ 25 = 0. 

15. For what values of k are the roots of x7 -+ 2x - k = 0 real? 

16. For what values of & are the roots of 2kx? + + — 1 = 0 not real? 

17. For what values of & are the roots of x27 -+ kz - 2 = 0 real? 


*18. Show that if 42 > Καὶ B? > Καὶ A > 0,and B > 0, then AB > K. 
*19. Show that 0 <a <bBA2x>O0—-a/b < (a+ 2)/(+ 2) A (a4+2) 
(+2) «1. | 

*20. Suppose that Y = C+ 8, when Y = national income, C = consump- 
tion, and S = savings. Show that with national income constant, a decrease 
in savings means an increase in consumption. 

#21. Proove0 <v «1-9»51-οὺυ <1— 2%. 


b 
*22. P cee 5) ὑπ λὲς, ee 
pa oa? FFG (ae a) 


%* 23. Prove (— ae | Ee sees Big 1) — (ky — ke < rig < ki + ke). 


24. Under what conditions is 4.360 > 2.650? 
#25. Prove that (ax + by)? < (a? + b?)(x? + y?). 

26. Solve for z:ax — 2+ (2 -- αὐ <a+a-+ (ὦ — a). 

3 2 

27. Solve = =r 4< πὶ 

28. “The square, divided by 10, of any number larger than 0 and smaller 
than 10... is a positive quantity smaller than the number itself.” (“Predicting 
Supreme Court Decisions Mathematically,” by Fred Kort, in the American 
Political Science Review, Vol. LI, No. 1, March, 1957.) Prove this. 
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ANSWERS TO EXERCISES 

(a) (a < 0) = (0 — a) € Ret = —aE Ret. (Ὁ) From (6) and the 
axioms. (d) (a « 6)< (b — a) Ε Ret (ce —c) + (ὃ — a) Ε Reto 
b-+e— (ate) € Re+oat+e<b+e. (ὃ For example, (11): If α is to 
the right of the origin, —a is to the left. (g) False for anym <0;m>0—- 
(mz - δ <0 2 < —b/m) Am 40 -» (ma+b < 002 « —b/m). 

(h) x < —8/7. (i) a < δ: the point a coincides with or is to the left of δ. 
Q)z=O0>2° πο; >0>2?>0. (Ὁ @+1I@—1)<00e0 
(ἡ 1) ΚῸΟΛ ἃ -- ΤῈ  ΟἹΧν [ ἘΣ ΟΛ @—-1) <0 
z@S-1lA#e#S>YV@>S-1lAzr<lo-i<2z< 1; and the graph 
of «7 -- 1 “0 is the closed interval (-- 1. (Ὁ a <—2V2> 2. 


(m) Je «« «τς (an) es —V1I7 V . Ὁ VI7. (0) —2 <2 « —1., 
(p) : < (-3 — VY5)/2 Va> (—3 + ν δ) 2. (q) -4 <2 <3. 


ANSWERS TO PROBLEMS 


1. From (2-2-11), (2-7-48), and other laws of logic. 3. (15)(a:0, b:1, c:a). 
5. (15)(a:—1, 6:0, c:a). 9. (—-VW2__V2). 11. (=4_7). 18. Ve [zx © Re > 
a? > 0) = ~ar~ [x Ε Re > x? > Ὁ] = wae [re Re A 22 < 0]. 

15. [522 --- ἀκ > 0) = {k|k -1<0} Ξ(--ο ἡ. 17.8 < —V85V 
k> V8. 


3-6 Operations on sets. We consider a universe of discourse U together 
with all its subsets. If A is the subset, the complement of A, A’ (read 
“A prime”), is the set of all elements of U that do not belong to A. 
Formally, 


(1) Def. A’ = {z|~(a € A)}. 


For example, if U = {0, 2, 4, 6, 8, 10}, {0,4,8}’ = {2, 6, 10}. 
(a) With the same U, {2,4,6}’ = ? (Ὁ) Justify (2) and (3). 


(2) xE A’ = «(x € A), 
(3) {ul f(x) }" = {x[~f(a)}. 


If A and B are subsets, their intersection, A τ B, is the set of elements 
of U that belong to both of them. 


(4) Def. ANB= {rjtEAAxEB}. 


For example, {0, 2, 6,8} M (2, 8, 10} = {2,8}. 
(c) {a,b, c,d} M {a,c,e,f} = ? (da) Justify (5) and (6). 


152 ELEMENTARY THEORY OF SETS [CHAP. 3 


(83) te (ANB) =xEAAzeB, 
(6) {x|f(x)} ἢ (τ Ἰσ(}} = [ἡ fe) A φᾷ()}ὃ. 


If A and B are subsets, their union, A U B, is the set of elements that 
belong to either or both of them. 


(7) Def. AUB= {xlxEA V wEB}. 


For example, {0, 2, 6,8} U {2,8, 10} = {0, 2, 6, 8, 10}. 
(6) {a,b, c,d} U {a,c,e,f} = ? (ἢ Justify (8) and (9). 


(8) rE(AUB =xEAVaecB, 
(9) {x|f(a)} ὦ {xl g(x)} = [1 7) V g()}. 


(6) The symbols M and U are sometimes read “cap” and “cup.” Suggest a 
reason for the choice of these symbols. 


The intersection, union, and complement may be conveniently visualized 
geometrically if we think of the universe of discourse as a region in a plane 
and the other sets as regions within it. In Fig. 3-3 we show the intersection 
as the region common to A and B, the union as the region consisting of 
those points in either or both A and B, and the complement as the region 
outside A. Note that A’ is not everything not in A, but everything in U 
that is not in A. Diagrams like Fig. 3-3 are called Venn diagrams. They 
are useful aids to the imagination; however, like the truth tables in Chap- 
ter 2, they are not to be confused with proofs. 


S525 
RK 


AUB 


ee 


FIGURE 3-4 
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A special kind of Venn diagram, due to Lewis Carroll, is illustrated in 
Fig. 3-4. It is very convenient for visualizing the subsets formed from a 
universe by two sets. 


(h) Copy the diagram for A M B in Fig. 3-4 and label the three unshaded 
Squares properly, using the following labels: A’ ἢ B’, A’ B, AN Β΄. Ona 
similar figure,shade (i) A’ UB’, (4) AUB, (k) AUB’, (1) (ANB). 
(m) Repeat Exercises (i) through (1) using a Venn diagram like that of Fig. 3-3. 

Let U = the human race, A = the residents of New York City, and B = 
the citizens of the U.S. A. Express the following symbolically and draw Venn 
diagrams: (n) the American citizens who live in N. Y., (o) aliens, 
(p) people who live outside N. Y., (4) citizens who do not live in N. Y., 
(r) the set of all people, (8) residents of N. Y. together with all aliens. 

With symbols defined as in the preceding paragraph, express in words: 
(t) ATUB, (u) ANB, (v) AUB, (w) ἡ! ἢ 8’, (x) A’U B’. 


Complements, unions, and intersections of intervals are very conven- 
ient for specifying the solutions of inequalities. For example, we found 
in Section 3-5 that {χώ 1 — 4 < 0} = (—2_ 2). Then 


(10) {a]z? — 4 > 0) = (—2_2)’ = {χΧ1 < —2 Vx > 2} 


(11) = {r/x < —2} U {x]z2 > 2} 


(12) = (πο -ὃὸ7.ν (2 οο). 


By combining intervals we can get a very large variety of sets of points 
on the axis of real numbers. Among these sets are those consisting of sin- 
gle points, since {x} = (a_x) ἢ (x_b) for any a < ὃ. Thus we may 
think of a set consisting of a single point as a “degenerate” interval. We 
might even write {x} = (x_ 2). 


Find ἃ simple expression for and sketch the following sets: (y) (20), 
(z) (83_5) NM (4_6), (7) (38__5) U (4 6). 


PROBLEMS 


An insurance company with policyholders U is studying characteristics of 
certain kinds of policyholders. Let A = adult policyholders, B = male policy- 
holders, C = married policyholders. Describe in words: 


| a 2. Β΄. 9. Og 

4 ANB. 5. AUB. | 6. BNC’. 

7. Β΄ σ᾿ Ο΄. 8. Β΄ ι) σ΄. 9. AN(BNOC). 
10. (AN BNC. ΤΊΣ; 12. A’ ἡ (Β Oo): 
18. Α΄ (ΒΩ Ο. 14. (BoNONA. 15. (AU BY. 


16. BU δ΄. 
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Using three circles, draw Venn diagrams of the following: 
17. AN (BNC). 18. AU(BUC). 19. (AN B’)’. 
20. (AUB) NC. 21.BU(CN A’). 22. A’N (BNC). 


* 23. Suggest ways in which diagrams of the type of Fig. 3-4 could be adapted 
to representing three or more sets. 


Sketch: 

24. (2—e 2+) wheree >0. 25. (8. 4) U (8 6). 

26. (3_4) MN (4 ὅ). 27. (—1_3) U (0 2). 

28. (--1__3) U ὦ 5). 29. (—1_3) N (ὦ 5). 

30. (—1__3) MN (O__2). 31. {1} U 4). 

Show that 

32. (—2%__b)’ = (Ὁ 9). 33. (ὃ οο)' = (-τοο δ). 


84. (a__b)’ = (--ὸ α)ὼ (ὃ οὐ). 35. (a__b)’ = (-τ-οὦ α)ὺ (ὁ 0). 
86. (a_b)’ = ς--οὦ α)ὺὼυ (ὃ “0). 487. (α δ)' = (-τ-ο αλὺ (ὃ ΤΟ). 
38. (a α) = {a}. 39. (a_a) = ὦ. 

40. α >b— [(a_b) = QI. 

41. (a b)M (ce _d) = (c_b) is not a law. 


For Problems 42 through 45, let A U B be the elements in A or B, but not 
both. 

% 42. Give a formal definition. 

*43. Draw a Venn diagram and a Lewis Carroll type. 

*44, Show that AU B = (AUB) N (AN By’. 

*45. Under what conditions is A UB = AUB? 


The relative complement B — A of A with respect to B is defined as the 
set of elements of B that are not in A. 


*46. Give a formal definition. 
*47. Show that the complement of a set is its relative complement with re- 
spect to the universe of discourse. 


Express 48 through 55 in terms of intervals. 


48. {x| (x + 1)(ς — 3) 9 0]. 49. {z|(x-+ 1) — 3) < O}. 
50. {222 — 3 < 0). 51. {2122 — ὃ > 0). 

52. (212 -Ὁ 52 — 6 > 0). 53. fala? - a+ 2»). 

54. fxla?+ 224 5 < 0]. 55. {a|x22-+ 22+ 1 < 0}. 


ANSWERS TO EXERICSES 


(a) {0,8,10}. (b) (3-3-3). (ὦ {a,c}. (d) (8-3-3). (e) ἴα, ὃ, ὁ, ἃ, 6,7}: 
(f) (8-8-3). (6) Similarity to A and V in terms of which they are defined. 
(h) See Fig. 3-5. (i) Fig. 3-6. (j) Fig. 3-7. (k) Fig. 3-8. () Same 
as Exercise (ἡ. (n) ANB. (0) BY. (p) A. (@) A’NB. (ὃν. 
(s) AUB. (ὃ Citizens who do not live in N. Y. (ὦ Citizens who live in 
N.Y. (Ὁ) Those who are citizens or who live in N. Y., i.e., citizens and 
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΄ 


FIGURE 3-5 Figure 3-6 FIGURE 3-7 FIGURE 3-8 


New Yorkers. (w) Aliens who live outside N.Y. (x) Those who live out- 
side N. Y. or who are aliens. (y) (-τ-ὸ 2). (52) ( 5). (4) (8 6). 


ANSWERS TO PROBLEMS 


1. Minor policyholders. 8. Unmarried policyholders. 5. All adults and 
all males. 7. Unmarried females. 9. Married adult males. 11. Empty 
set. 13. Minors and all married males. 15. Those who are neither adults 
nor males, 1.e., the minor females. 25. (3 6). 27. (—1_8). 28. (—1_ 5). 
31. A point and an interval. 32 through 41. Use definitions from Section 3-4. 
42. (1 ΞΕ AV 2:6 Β). 45. ΑΓΒ -- Ὁ. 44.B—A = ANB. 

40. (—1_ 3). 51. (~~ __—+/3) U(V3__~). δ8. (--ο οὐ). 55. @. 


*3-7 Algebra of sets. Since operations on sets are defined in terms of 
_ the corresponding logical operations, we might expect that the sets would 
satisfy laws similar to those of logic. 


(1) (A UB)’ = Α’ NB’ 
: (De Morgan’s laws), 
(2) (A NB)’ = A’UB | 
(3) AUB=BUA 
(4) Εν (Commutative laws), 
b= N 


(5) AU(BUC)=(AUB)UC 
6) AN(BNC)=(ANBNC 
(7) AU(BNC)=(AUB)N(AUC) 
(83) AN(BUC)=(ANB)U(ANC) 


(Associative laws), 


(Distributive laws), 


(9) AUA=A 

(Idempotent laws), 
(10) ANA=A 
(11) (4) =A (Involution law). 


Diagrams are helpful in appreciating these and other laws. By making 
separate diagrams for each side, we can see that different procedures are 
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RELA 
R504 


no ζζζξλ Αι β V/A Al 
SQ (4 uy BY QA 8 
[κου Α΄ ἡ δ’ 
FiGURE 3-9 
A A Ae 
ἘΠ Ζ2 
B ΄.- Β 
ῥα ας ἘΠ 
β΄ Β΄ 
QW 4 LLZA ANB An(BuC) 


VA/ ZA Bot 


SN Αἱ ἡ (Οἱ 


FiacureE 3-10 


(An By) u(AnC) 


involved and that the final result is the same. Different kinds of shading 
should be used for each set that is of importance, and the sets should be 
clearly labeled. Figure 3~9 shows Lewis Carroll diagrams of the two sides 
of (1). In Fig. 3-10 we sketch the sides of (8), letting the central square 


represent C. 


(a) Let U 
AN(BNC) 


people, 


«ἢ ἢ 


A = students, 8 
students who are both male and female, and A U (BUC) = 


males, C = females, so that 


those who are either students or men or women. State each law above in words 


in terms of these sets. 


The proofs of the above laws are easy. 


One simply applies the defini- 


tions and then the identities of Section 2-5. For example, 


(12) Proof of (3): 
(a) AUB= 
(8) = 


{e4jxEAVxEB} 


(8-6-7) 


{fzltEBVxreEed} 


pV q=¢V p)(pix € A, git € B) 


(Y) BUA 


(3-6-7) (A :B, B:A). 
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(0) What rules of proof were used in step (6)? (ο) in step (Y)? 


One can also argue informally for these identities by showing, as called 
for by (8-3-4), that every element of one set belongs to the second, and 
conversely. ‘To argue for (8) in this way we suppose that an element be- 
longs to A U B. Then it must belong to A or to B or both. But then it 
must belong to B or to A or to both, and hence to B U A. Similarly, be- 
ginning with an element belonging to B U A, we argue that it must belong 
to A UB. Of course this is merely an informal paraphrase of (12) with 
the logical identities remaining implicit. 


(d) Argue informally for (1) and prove it. 
(13) Def. AUBUC=AU(BUO, 
(14) Def. ANBNC=AnN(BNO, 


(15) AU@= A, 
(16) Α ὦ ὁ = @, 
(7) AUU =U. 
(18) AnU= A, 
(19) AUA'=U, 
(20) AN A’=@Q, 
(21) U' = Q, 
(22) (' = U, 
(23) (A = B’) @ (Δ’ = B), 
(24) A=(ANB)U(ANB), 
(25) AUB= (ANB) U(A’N Β)  (Α NB), 
(26) = (AN B)U(A'’NB)U(A NB) VU (A' NB), 


(27) Αυ (Α ἡ Β) -- Α, 
(28) AN(AUB) = A. 


(6) Argue for the above from diagrams and informally from the meanings 
of the terms. (f) Prove (15). (4) Prove (23). 

Simplify: (h) BN(AU 4’), ὦ AU(CUC’), ᾧ (SNTIN(TNA 7, 
(kk) CNCON CNC), d) BUQU (BNC), (m) (AUBUC)U 
(AN BNC), ἡ ὕω ἃ, (ἡ UU®D, ῳ) (BNC U(BNC’. 

(ᾳ) SAT)NUSNATIUSAT), (ὁ A’U(BNCY. 
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Using a diagram like Fig. 3-10, sketch each of the following: (8) 4΄ ἢ 
(BUC) tH) ANBNG WANBNC, Ww) AUBUC, 
(w) ANB nec, (x) AU(B'NC’, 0) the set in Exercise (r). 


PROBLEMS 


In the introduction to a recent dictionary there appears a diagram similar 
to the one in Fig. 3-11, in which X = formal literary English, Y = colloquial 
English, and Z = illiterate English. 


Y 


(oN 


Figure 3-11 


1. Name a,b,c, d,e,f,aUb, bUc, an X, and bUd Ue in terms of X, 
Y, and Z and verbally. 

2. What assumption is implicit in the diagram? Express it in symbols and 
words. 


3. For A, B, and C defined as in Problem 4, find the sets of Exercises (s) 
through (y). 

4. Illustrate the laws of this section for U = {1, 2,3, 4,5}, A = {1, 2, 3}, © 

= [2, 8, 4),  -- {2, 4, 5}. 

5. Find the complement of (A’ U B’) N (A’ UC’). 

6. Prove some of the laws of this section. 

7. Derive (26) by applying the distributive law to the right member. 
#8. Discover and prove some laws involving the relative complement. 
*9. Do the same for A U B. 

%10. Interpret the following definitions: 


(20) Def. US = {z|yyyESA Ey}, 
(30) Def. NS = {x|\vyyES—uxey}. 


*11. Show that U{4, B} = AUB, andN{A, B} = ANB. 

*12. Let K = the set of points in a plane within and on the boundary of a 
square. Let S consist of all circular regions that include K. Argue that Ns = K. 
*13. Let C = the set of all points in a plane within and on the circumference 
of a circle. Let T be the set of all polygons inscribed in the circle. Argue that 
UT - 6. 

14. Aristotle expressed the fundamental laws of logic as follows: (1) All A 
is A. (2) Nothing is both A and not A. (3) Everything is either A or not A. 
Express these laws in set terminology. 

15. Prove (28). 
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ANSWERS TO EXERCISES 


(a) (1) Those who are not either students or males = those who are non- 
students and nonmales. (2) Those who are not male students = those who are 
either not students or not males. (3) Those who are either students or males = 
those who are either males or students. (4) Student males = male students. 
(5) Those who are either students, or males or females = those who are students 
or males, or females. An alternative: Students together with males and fe- 
males = students and males together with females. (6) Students who are males 
and females = student males who are females. (7) Students together with 
the male females = those who are both student males and student females. 
(8) Students who are males or females. (9) Students and students = students. 
(10) Students who are students = students. (11) Those who are not non- 
students = students. Note: Either “and” or “or” may be involved in a correct 
translation of U. Thus A U B = the A’s and the B’s = those who are in A 
or in B = those who are in A and those who are in B. But AM B = the A’s 
in B = the B’s in A = those who are in A and in B. It is best not to rely on 
rules or memory, but to understand the meaning and translate into the best 
English in the particular context. 

(Ὁ) Rule of Substitution in pV q=¢qVp to gt re AVXEB = 
x€&BV «x€A, then Rule of Replacement to insert the second for the first. 
(c) Rule of Substitution in (8-6-7), then Rule of Replacement, to replace the 
right member of (8) by (7). _(d) If an element belongs to (A U BY’, it does 
not belong to A U B. Hence it cannot belong to A or to B. Hence it does not 
belong to A and it does not belong to B, that is, it belongs to A’ Β΄. The 
converse is shown similarly. The proof goes as follows: (A U B)’= 
{a[z € (A U B)’} = {r]~e@eAvU B)} = {zrj~(xE AV 18 β))}- 
{z|~(eE A) A ~(e Β)} = {z|2E A’ A ce B’} = [1[: 6 (A'N B’)} = 
A’) Β΄. An alternative way of working the proof is to write x Ε (A U B)’ = 
~(xE€ (AU B)) =... = 2 € (A’N B’, and then use (3-3-4). 

(e) For example, (15) says that the objects that belong to a set or to the null 
set are just those that belong to the set. (f) AU@ = {zjrE AVrE MD}. 
By (3-3-2), “(ας Ε M), and by (2-7-3831) ~(t@ Ε 6) > [(txE AV χε 6) «-» 
(cx € A)]. Hence by the Rule of Inference (x € A V x EM) (rE A). 
Hence {z|x€ A V x EO} = {x|xz © A} by (3-3-6), and {jx A} = A. 
(g) By hypothesis 4 = B’. Hence A’ = (B’)’. But (B’)’ = B by (11)(A:B). 
Hence A = B’ —> A’ = B. The converse is proved similarly. (h) B. (i) U. 
ὁ), ὁ. (kK) @ (@) U. αὖ ἡ. (0) ὁ. (p) B. (q) SUT. 
(ἡ) (ANBNCY. 


ANSWERS TO PROBLEMS 


lLa=XNYnNZ,b= XNYNZ,c=XNZd=XNYnZ' 
e=X’NYNZ,f = X'NY'NZ,aUb =XNZ',bUc = XNY,anX =a, 
bUdUe = YN(XNZ)’. 2. XN Y'NZ = @. There is no formal literary 
illiterate English that is not colloquial; i.e., all formal literary English that 
is illiterate is also colloquial. 3. (s) {4,5}. (t) {2}. (u) @. (v) {2}”. (w) @. 
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(x) {1, 2,3}. (y) {1, 3, 4, 5}. 5. AN (BUC). 7. Right member = 
[AN (BUB)Ul[A’N (BU ΒΒ = (ANU U(A'NU) = AU A = U. 
15. AN (AUB) = (AUBN(AUB) = AU@NB) = AUD = A. 


*3-8 Relations between sets. In Section 3—4 we defined and illustrated 
briefly the relation of inclusion. With the aid of set algebra we can an- 
alyze inclusion further and consider some other possible relations between 
sets. 

From the definition (8-4-1) we have immediately 


(1) ACA, 
(2) [ACBABCC)- ACC, 
(3) QCA, 
(4) ACU: 


From (3-3-4), | 
(5) [ACBABCA]—-A=B. 


Additional laws are 


(6) (ANB) CA, 

(7) ACc({(A UB), 

(8) ACBeB'CcA’, 

(9) ACB—>(ANC)C(BNQ), 
(10) ACB> (AUC) ς« (Βυ Ο), 
(11) ACBe{AnB= Al], 

(12) ACBe[AUB= BI, 

(13) AGC(BNC)e[ACBA ACA, 
(14) (AUB) CCe[ACCA BEA, 
(15) [ACBA~(ACO]—-~~(BCO), 
(16) ACBO[ANB = Ql. 


These theorems are intuitively evident if they are expressed verbally 
or illustrated by Venn diagrams. They are also easy to prove by simply 
using the definitions and applying theorems from Chapter 2. For ex- 
ample, to prove (6), we assume by hypothesis that x € AN B. Then 
reA A XEB. But by the law [p A 4] — p and the Rule of Inference 
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it follows that x € A. Hence Vz (x € A N B — 2 € A), which is equiva- 
lent to (6) by the definition of inclusion. Or we may argue informally 
that if an element hes in both A and B, it must lie in A, and hence every 
member of A M Bis a member of A. ; 


(a) Argue informally for the above laws in terms of Venn diagrams and the 
meaning of terms. (b) Prove (7). 


To prove (15) we note that it is equivalent to (2) by the identity 
(pA 4) -Ὁ 7] ξε [ῷ An~r) > "αὶ 


with 
(pACB q@BcC,r:A CC). 


(c) Prove the logical identity (2-5-35). (d) Make the substitution (A:C, 
B:A,C:B) in (2) and then prove (17). 


(17) [ACB AAW~(CCB)]->~(C CA). 


To show the plausibility of (16) we note that A is a subset of B if and 
only if every member of A is a member of B, that is, if and only if no mem- 
bers of A are in B’, that is, A ὦ B’ = @. 

To prove it, we write 

ACB=  We(tx EA 2x EB) 
= [~Jr μι € Α -- χα Ε Β)[)} 
[- ΞΟ EA A σε ΒΒ] 
= [~Jr2 ΕἘ(4 ἢ B’)] «» [A NB = 9]. 


| 


(6) Justify each step in the preceding argument. 


If two sets have no members in common we say that they are disjoint. 
This relation may be symbolized by A // B, the notation being suggested 
by the fact that two straight lines in a plane are disjoint if and only if 
they are parallel. 


(18) Def. A//B=[ANB= QI. 

(f) Draw a Venn diagram showing two disjoint sets. (g) Show 
(19) A//B>B//A. 

(h) Prove (20). 


(20) A//BoOACB’. 
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(i) Prove that A//Bo BC A’. 

(21) [A//BN A/S/C]) RP A//(BUC). 
() Show that [A//B A Β΄ — A//C is not ἃ law. 


We define two further relations, leaving their properties to be worked 
out by the reader. 


(22) Def. ACB=|AGCBAAF Bl. 
When A C B we say that A is a proper subset of B. 
(k) Give an example for which AC BA ACB. 
(23) Def. AYB=-~[ACBVBCAVA//B\. 


When A { B we say that A overlaps B. 


(1) Draw a Venn diagram illustrating A { B and several examples in which 
~(A { B).  (m) Illustrate the definitions and theorems of this section using 
U = {1, 2,3, 4, 5} and its subsets. 


PROBLEMS 


1. Complete the reasons in the following proofs and supply informal argu- 
ments for the laws. 


(24) Proof of (2): 
(a) rE AOLEA (3), 


(b) ACA (3-4-1) (B:A). 
(25) Proof of (8): | 
| (2) ACBA BCC Hyp, 
(bt) @@E A>2rEB)A (@EB>xEC) (1), (2-10-14), 
(cc) cE 4 “ χεῦΟ (9), 
(4) ACC (ἢ), 
(e) (3) Q.E.D., (a), (d). 


2. Prove (8) by using (2—5—28). 

3. Prove (9) by using (2—7-25). 

4. Prove (10) by using (2-7-11). 

5. Prove (11) by noting that AMBCA by (6), and proving that 
ACANQB from the hypothesis A € B and (9)(C:A). Why is this sufficient? 

6. Prove (12) from (7) and (10). 
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7. Prove that AC Bo A//B’. 
8. Prove (26). 


(26) ACBe[ACBV A = Bl. 


9. Discover, argue for, and prove other laws that involve proper inclu- 
sion. 
10. Prove (27). 


(27) [A//B A CC ΑἹ > C//B. 


11. Prove that the sets in the right member of (3-7-25) are disjoint two by 
two. 

12. Do the same for (3-7—-26). 

13. Prove (28). 


(28) AXYB—BYA. 


14. Do Problem 9 for overlapping. 
15. Show that JA 3B A//B A ACB! 
16. Show that A = BV ACBVBCAV A//BV AX Bis nota law. 
Show that it is a law if the hypothesis 4 ~* @ A B Σέ @ is added. 
*17. Prove [{zla< 2< δ) = {alec x<@lro@=cAb= a). 
*18. Prove (2 α < “2 « δ) γέ {2la < , < DB}. 
*19. Prove {z|1 < 2 < 2} ς {x]1.5 < x < 7}. 


Under what conditions does 


*20. (a_b) ¥ (c_d)? 
*21. (a_b)//(c_d)? 
#22. (a_b) C (ς ὦ) 


*23. Prove AGMA = @. 

*24. Consider the following sets of figures in plane geometry: T = triangles, 
O = obtuse triangles, R = right triangles, E = equilateral triangles, S = 
scalene triangles, J = isosceles triangles, A = acute triangles. What relation 
holds between each pair? (Pair each one in order with every following one.) 
*25. ProvejzjyxeEyAxcy. 

*26. Proove[AUX = UA AN X = 6] > [X = A’). 


ANSWERS TO EXERCISES 


(a) For example, (7) is plausible, since any member of A is certainly in A 
or in B. Again, (8) is true because if every element of A is in B, any element 
not in B cannot be in A. (b) tEAr>MTEAVXEB—>2xE(AUB) 
by(p>pV φίῳ:ι Ε Α, 4: EB). (Ὁ ΙΪῷ ΛΔ φ) -οῦ Ξ-ίνυῳ Δ ὧν τ = 
~pV~dV τὶ (pA Ὁ) το ἢ] Ξίψῷ Δ ar) V+) = ~pV~erVn~gs= 
~pVer~gVr. (d) It yields CC AA ACB—-CCB; then by (2-5-35) 
(p:A © B,q:A © B,r:C ς B) we have (17). 

(e) (3-4-1); (2-10-38) (f(a) :~(¢ € A — 2. B); (2-5-24); (3-6-4); (3-3-8). 
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Θ 


AtB 
FicgurE 3-12 FIGURE 3-13 


(f) See Fig. 3-12. (g) Since[AN B = 9] = [BN A = 0]. (h) (16), (18). 
(i) (19), (20). (j) (4:1, 2}, B:(3, 4}, 6:[1,2})}. (ἡ (A2{1, 2}, B:{1, 2, 3}). 
(1) See Fig. 3-13. 


ANSWERS TO PROBLEMS 


1. (24a) pop (250) [( -- 4) ΛΔ (ᾳ --Ὁ 7)] -Κ (por). (25d) (3-4-1). 
Note that quantifiers are omitted. 3. (ρ::Ὲ Α, φιῖχ Ε Β, τῖχ EC). ὅ. (δ). 
7. ACB ANB - ( κε» A//B’ by (16) and (18). 9. Some examples: 
~(AC A) ACBoO*(BC A) ACBABCCAACE. 11. Any 
product of two of these sets has AM A’ or BM B’ as a factor. 13. (23). 
15. (4:0, B:@). 21.6 <cAac<d. 


3-9 Descriptions. It is often convenient to refer to something as the 
object that satisfies a certain condition. For example, “the set that has 
no members” is a description of the empty set. Such descriptions take 
the form “the x such that... ,” where the dots stand for some sentence 
involving x that is satisfied by one and only one value of x. We introduce 
the symbol 9 to stand for “such that” and adopt the formula x 3 f(x) to 
stand for the unique solution of f(z). For example, {x 9 2x = 1] = 1/2. 
We read x 9 f(x) as “the x such that f(x).” Expressions of this form have 
meaning if and only if there is one and only one solution of the condition 
f(x), that is, if and only if 3!z f(x). The essential property of this undefined 
formula is given by the following axiom. 


(1) Ax. [a = 2 3 f(x)] = (f(a) A Wy fy) ry = αἱ. 


(a) Read (1) in words. (b) Why is “xr” in “x2 f(z)” adummy? (c) Justify 
a = (x32 =a). (d) Comment on “x92? = 1.” Justify the following: 
(e) 1 = (xa9Vz2a2 =2), (ὃ O= (raVyytur=y), (8) a/b = (x9 xb =A), 
(8) α --ὃ = (#ax+b=a), (i) {zl f(z)} = SavrelzxeS< f(x)]. 


We illustrate the utility of the description notation by using it to 
formulate several definitions. 

Given a set of real numbers S, we say that χ is the maximum of S if x 
is in S and is larger than any other member of S. Similarly we call x the 
minimum of S if it is in S and is smaller than any other member of S. 


3-9] DESCRIPTIONS 163 


Geometrically the maximum of S is the point in S farthest to the right, 
and the minimum is the point farthest to the left. 


(2) Def. maxS=azareS A Vyl[yeS-u2z 


IV 


yl, 
(3) Def. mnS=zvarEeSAVylyeSoux< yj. 


Find the following: (j) max {1,2,3}; (k) max {5}; (1) min {4, —3, 8}; 
(m) max {—3,3}; (ὦ) min {z|2? < 5}. 


In Section 1-4 the absolute value of x, symbolized by |x|, was intro- 
duced intuitively as the length of the vector x. In (1-13-17) we defined 
it by considering separately the cases x > 0 and « < 0. We now can 
give a simpler definition. 


(4) Def. lz] = max {x, -- οἱ. 


(0) Use (4) to find |0|, |—3}, [10]. 


We can now easily prove the following, which are equivalent to (1-13-17). 


(5) zr>0< |2| = 2, 
(6) z<i0< [2] = —2. 


The reader should check that the definition is consistent with the intui- 
tive concept. Recalling from Section 1-4 that b — a is interpreted as 
the vector from a to ὃ or as the directed distance from a to ὃ, we see that 
δ — αἰ is the length of this vector. Accordingly we describe |b — αἱ as 
the undirected distance between a and b. In particular, |x|, which is equal 
to |x — Oj, is the undirected distance between the origin and the point z. 
These geometric interpretations are very helpful in dealing with absolute 
values. 

The following laws are evident from the geometric interpretation. They 
can, of course, be proved from (4). 


(7) Ια] > 0, 

(8) ([α| = 0) — (a = 0), 

(9) Ια] = |—al, 

(10) (lal = [8]) 9 @=b Va= --ὃ), 
(11) ja+ ὃ] < fal + [6], 


(12) ja} — [bf < fa + 6], 
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(13) jab] = fal - |b], 
(14) max {a,b} = (1/2)[a + ὃ + |b — aj], 
(15) min {a,b} = (1/2)[a + ὃ — |b — al]. 

The most useful law for solving equations involving absolute values 
is (10). Thus (la — 1] = 8, «» (Σ -- 1 -ῖξ - ὃν. ---ῚΤ Ξξ --ὃ) «» (ὦ -Ξ 
4 V καὶ = —2). A still easier way to solve this equation 15 to note that it 


requires that x lie at an undirected distance of 3 from 1. Hence it must 
lie at 4 or —2! We sketch this in Fig. 3-14. 


Solve each of the following and sketch as in Fig. 3-14 {note that x-+ 2 = 
x — (—2)]: (p) lz -- 8. = 1, (q) |4—2| = 3, (Ὁ [2+2| = 1. 


FIGURE 3-14 


Solving inequalities involving absolute values 15 most easily done from 
the geometric interpretation. Thus to solve | — 8] < 2, we note that it 
requires x to lie within an undirected distance of 2 from 3. Hence it lies 
between 1 and 5 inclusive, i.e., in the closed interval (1 5), which is the 
solution. Similarly, the solution of |x — 3| > 2 is (—#__1) U(5__0). 
Also, since 

jx + 3] < 13 |e — (—3)| 1.» --άἀ “χ < --2, 
then 
(x| |e + 81 < 1} = (-4_—2). 


Solve and sketch the solution for: (8) |z| < 3, (t) |z| => 8, 
(u) [2 — δ < 3, (v) |e — 8] > 2. 


The following laws generalize the results of the previous exercises. 


(16) {x| |x| « δ) = (--ὸ ὃ), 

(17) {a| |x| > b} = (—w__—b) U (b_), 

(18) (2| jc — αἱ « b} = (a — b_a-+ ὃ), 

(19) {x| |x — al > b} = (-w__a — ὃ) U (a+ ὃ ο). 


Use absolute values to write brief defining sentences for: (w) (—e_e), 


(x) (3 — 6_8+ 4), (y) ὦ 4), (0) @— ὃ α- dy. 
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PROBLEMS 


1. In the light of Exercise (g), explain briefly why “a/0” is meaningless. 

2. Comment on “Whenever they disagree, I choose the version which, in 
my judgment, is the more credible and at the same time the more interesting of 
the two.” 

3. Show that [a = x 9 f(x)] = [{a} = {x|f(x)}]. 

4. Comment on “max {x|z « 1}.” | 

5. Let Τά) = the truth value of xz. Show that T(p V 4) = 
max {T(p), T(q)} and T(p A 4) = min {T(p), T(qQ)}. 

6. What is the range of significance of “S” in “max S”? 

7. Cite several sets S such that max S or min S does not exist. 


Solve and sketch: 


8. [24] = 4. | 9. |2a - 3] = 7. 
10. [1 = —1. 11. [1| = 0. 

12. [1.  4{ < 1. 13. ja + 4| > 5. 
14. |x — 3] > —2. 15. [22 — 1] < 2. 
16. [2 — 5| < 0.01: 17. |z — 2| < 0.02. 
18. [5 -᾿ 0.01 < 5. 19. [1 — 32] = al. 


20. Find ὁ and ὃ such that (a__b) = {x||x — οἰ < 6}. 

21. Use absolute values to write brief defining sentences for (3__8), (--1 δ), 
and (2 4). 

22. Often results of scientific measurements are given in the form z = a + 6, 
meaning that x probably differs from a by at most e. Express this in each of 
the forms a < x < 8, [5 — Ὑ] < ὃ, andzE (r_s). 

23. x9aVy~ (YExu) =? 

24.z29VyyEuru=? 

25. Use a description to define \/z. (See 1-13-16.) 

~%&26. Prove (5) by proving thatz > 0 = ~ x > —2z. 

* 27. Prove (6) similarly. 

* 28. Prove (7) and (8) by brief arguments. 

*29. Prove (9) by showing that max {a, —a} = max {—a, —(—a)}. 

* 30. Prove (10) by noting its equivalence to max {a, —a} = max {b, —b} << 
a=6bVa= --ῦ. 

*31. Writing (11) in the form |a — (—b)| < [αἱ - [Ὁ], interpret it geo- 
metrically, sketch, and argue for its plausibility. 

*32. Prove (11) by considering the cases a > 0, b> 0; a <0, ὃ «0; 
a>0,6 < 0;anda «0, ὃ » 0. 

*33. Treat (12) according to Problems 31 and 32. 

* 34. Symbolize by a description “the greatest good for the greatest number.” 
Does it exist? 


ANSWERS TO EXERCISES 


(a) “ais the x such that f of x” means that f of a is true, and if f of y is true, 
then y equals a; that is, a is the one and only solution of f(z). (b) x 3 f(x) 
stands for the solution for the variable x of the sentence f(x). Without a vari- 
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able present, this would make no sense. Hence x cannot be replaced by a 
constant. (c) By (1), since (a =a) AVyl@ =aAy =a) (ὦ Ξ y)). 
(d) Nonsense, since the sentence has two solutions. (6) Vel-z=2z2A 
Vy [Vzyz = 2z2—-y = 1). (ἢ) To avoid confusion due to y appearing in two 
quantifiers, we substitute (y:z) here to get 0 = (3 Vzz+2= 2). Then 
VzztO=2z2A VyWezety =z7y = Ol. (g) (a/b)b =a A Vyyb = 
a—>y = a/b. Note that this holds only if ὦ ¥ 0. (i) [x € {x|f(xz)} > 
fa) A Wy [Wea Ey f@)) > y = (2]7}}}.. 

j) 3. (Kk) 5s (ἡ —3. (m) 8. (n) —V5. (0) 0,3,10. (p) (2, 4}. 
(ἃ). {1,7}. Qtajhesh (ΓΞ, 4b) ee) Uo). 
(u) ῷ 85. (v) (—%_6) U(10__%).  (w) [el «ε 60) [2 — 3) « ὃ. 
(y) [5 --8.»1. (Ὁ [ze -al > δ. 


ANSWERS TO PROBLEMS 


1. χ 0 = a has no solution or more than one according to a ~ 0 ora = Q, 
as explained in Section 1-14. 3. To say that {x|f(x)} is the singleton {a} is 
to say that a is the unique solution of f(x). 5. Refer to truth tables and con- 
sider cases. 7. (2 5) has neither max nor min. 9. {2,—5]. 11. {O}. 
13. (—9_1)’. 15. (—0.5__1.5). 17. (1.98__2.02). 19. {1/4, 1/2}. 

21. |x — 5.58] « 2.5, |x — 2] < 8, χῳ -- 8, <1. 23.0. 25. yay? = 
rcANy>0. 27.2<0->-41>0-> τοῦ > α > max {z, —2} = —2. 

29. Since {—a — (—a)} = {—a,a} = {a, —a}! 31. The distance between 
—b and a is not greater than the sum of the distances between the origin and 


a and 6. 


*3-10 Sets and sentences. When we say that a number is prime, we 
are asserting that it belongs to the set of prime numbers or that it has the 
property of being an integer greater than 1 whose only factors are itself 
and 1. In this way every statement asserting set membership can be 
reformulated as a claim that the object has the property that is peculiar 
to members of the set. This correspondence between sets and properties 
is embodied in (3-3-3). Because of it we can formulate any sentence in 
terms of set membership, or, conversely, we can formulate any sentence 
about set membership in terms not explicitly involving sets. 


(a) Consider the statement “3 is odd.” Complete the following equivalent 


sentences: ᾧὌορᾶ E __ ἢ. “8. is: an integer _____.”_ (b) Consider 
the statement “He is my father.” Gomplets the following equivalent sentences: 
= ἘΞ father. a ee ee πὰ 
(c) Consider “There is an even prime number.” Complete the following 
equivalent sentences: “fol zis prime and ziseven} # —__— Ix 


Exercises (a), (b), and (c) suggest the variety of ways of expressing 
ideas in terms of properties and sets. To systematize the relations be- 
tween sets and sentences, let P = {x|p(x)},Q = {xle(x)}, and Rk = 
{x|r(x)}. Below we give a table of statements in terms of the sets and the 
corresponding synonymous statements involving the defining properties. 


3-10] SETS AND SENTENCES 169 


(1) aeP p(a), 

(2) P=Q Va p(x) <> g(a), 

(3) Ρςῳ Vx p(x) —> 4(), 

(4) P//Q Va ~ (p(x) A 4()), 

(5) ae Ρ' ~p(a), 

(6) aEPNnQ »γ(α) A ga), 

(7) aeEPUQ _ pla) V qa), 

(8) P=9Q Ve ~ p(x), or ~dx p(x), 
(9) P= 1 Va p(x). 


Translate into set terminology: (d) He is neither happy nor wealthy. 
(fH = happy people; W = wealthy people.) (e) He is healthy, wealthy, and 
wise. (S = wise people.) (ἢ) He is a happy man. (M = men.) (g) If 
Z ABC is a right angle, then AB? + BC? = 463, and conversely. (R = right 
triangle; P = triangles such that the square of the length of one side is the 
sum of the squares of the lengths of the other two sides.) (8) A number 
cannot be both even and odd. _ (1) If z is a man then x is mortal. (j) No 
one lives forever. (k) If a triangle has two equal angles then it has two equal 
sides. 


There are always many ways of stating an idea verbally. As the previous 
exercises illustrate, the essential thing is to understand the meaning and 
then restate in precise terms. Sometimes the set language is simpler. This 
is evidently the case in (2), (3), (4), (8), and (9). There the set language 
enables us to dispense with explicit use of quantifiers. For example, it is 
easier to say that all men are mortal (men ΓΞ mortals) than to say that if 
x is a man then z is mortal. On the other hand, the set terminology is 
sometimes less convenient. This appears to be the case in (1), (5), (6), 
and (7). For example, it is easier to say that John loves Mary than to say 
that John belongs to the set of people that love Mary. In many cases, 
either terminology is used conveniently. For example, we say that 3 is 
prime or that 3 is a prime number. | 


(1) Review Section 2-10, especially (10) and (11) and Problems 28 through 
43. State the following in terms of sets. (m) All triangles are polygons. 
(n) All equiangular triangles are equilateral, and conversely. (0) No oblique 
triangles are right triangles. (p) Some isosceles triangles are equilateral. 
(q) Not all odd numbers are prime. (r) Every integer has one as a factor. 


In classical (presymbolic) logic, much reasoning was carried on in terms 
of “all,” “some,” and “none,” a kind of primitive reasoning in terms of 
sets, which was called “syllogistic reasoning.” The rules for such reason- 
ing are complicated, and all such problems can be handled more easily 
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in modern symbolic terms. The easiest method is to express all statements 
in terms of set inclusion, then use the laws of this chapter. 
The following table indicates the appropriate translations: 


(10) All P are Q, Pod. 
(11) — Not all P are Ὁ, or 

Some P are not Q, ~(P CQ). 
(12) No P are Q, or 

All P are not Q, Ρς-ῷ.. 
(18) Some P are Q, ~(P CQ’). 


In the typical syllogistic problem, we are given two premises of the forms 
(10) and (11) and asked to derive a third if this is possible. For example, 
given that all collies are dogs (C C D) and that all dogs are mortal (DCM), 
we have immediately that all collies are mortal (C C M) by (8-8-2). All 
syllogistic reasoning can be reduced to the application of (8-8-2) or its 
variants (3-8-15) and (8-8-17). For example, given that all primes 
greater than 2 are odd (P C D) and that some numbers greater than two 
are not odd (~(N C D)), it follows that some numbers greater than 2 
are not prime (~(N C P)) by (38-8-17). 

To test a syllogistic argument for validity, simply express its premises 
and conclusion in set terms and see whether it follows from the laws of 
set inclusion. For example, given that all primes greater than 2 are odd 
and that some numbers greater than 2 are not prime, does it follow that 
some numbers greater than 2 are not odd? In symbols, 


(PCD) A~(NCP)>~(N CD)? 


The reader can easily check that this is not in the form (8-8-15) or (8-8-17). 
Recalling that a valid argument must be in the form of a law, we can 
easily show that the above is invalid by the counterexample (P:{1, 2}, 
D:{1, 2, 3, 4}, N:{8, 4}). 

Of course we could handle syllogistic problems directly in terms of laws 
of logic, using quantifiers, or we could express the inclusion relations in 
various ways by using (3-8-16) or (8-8-20). In any case, by symbolizing 
such problems we avoid both the complexity and ambiguity of the tradi- 
tional logic. 


(s) “If the first premise is the proposition that all human beings are moti- 
vated by self-interest in their actions, the conclusion that all rulers tend to 
serve their interests can readily be obtained by means of a syllogism.” (“The 
Issue of a Science of Politics in Utilitarian Thought,” by F. Kort, American 
Political Science Review, December, 1952) Do this. 


9--10] SETS AND SENTENCES 171 


What conclusion can be drawn from: (t) all AisBandnoBisC,  (u) all 
Ais Bandsome AisC, (v) no A is B and some ( is B? 

(w) Some laws are complicated; no confusing laws are satisfactory; every 
complicated law is confusing. Draw all conclusions you can. (x) No A is B, 
no Bis C. Can you draw a conclusion? 


PROBLEMS 


Express Problems through 12 in terms of sets and using quantifiers. 


1. All dogs are animals. 
2. No dogs are able to talk. 
3. No one can be both a man and a woman. 
4, Not all men are good men. 
5. Some women are blond and blue-eyed, some are blond and not blue-eyed, 
and some are blue-eyed and not blond. 
6. Some men are both rich and happy. 
7. If a triangle has two equal sides, it has two equal angles. 
8. If a triangle has no equal sides, it has no equal angles. 
9. Not all primes are odd. 
10. No perfect squares are primes. (Is it true?) 
11. Some even numbers are prime. 
12. Some A is not B. 


With P, Q, R defined as for (1) through (9), continue the table of synonyms 
by inserting missing entries in 13 through 18. 


13. PCQ, ? 14. PX Q, ? 
15. Ρ ¥ Ὁ, ? 16. P # U, ? 
17. ? p(a) V q(a). 18. ? p(a) A g(a). 


*19. When two sentences cannot both be true, they are said to be contradic- 
tory. Suppose p(x) and q(x) are contradictory. Express this in terms of 
quantifiers and in terms of sets. 

20. How is the discussion of this section illustrated in Section 3-5? 


In Problems 21 through 26, draw any conclusion you can or decide on the 
validity of the reasoning. 


21. All A is B, some A is C. 

22. No Ais B, allCis A. ... NoC is B. 
23. All Ais B, noC is A. .᾿. NoC is B. 
24. No Ais B, no Bis C. 


25. “Of the prisoners who were put on their trial at the last Assizes, all, 
against whom the verdict ‘guilty’ was returned, were sentenced to imprison- 
ment; some, who were sentenced to imprisonment, were also sentenced to hard 
labor. Hence, some, against whom the verdict ‘guilty’ was returned, were 
sentenced to hard labor.” [This and Problem 26 are among the many highly 
realistic and practical logical problems composed by Lewis Carroll and pub- 
lished in his Symbolic Logic in 1897.] 
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26. “No kitten that loves fish is unteachable; no kitten without a tail will 
play with a gorilla; kittens with whiskers always love fish; no teachable kitten 
has green eyes; no kittens have tails unless they have whiskers.” Draw all 
possible conclusions. 


27. Go through this chapter, formulating the laws of logic corresponding to 
each law of set theory. 

28. Do the reverse, translating laws of Chapter 2 into set terminology. 

29. In this section we have violated our agreement about the meaning of 
“equals.” Where? 

30. Read “The Maneuvers in Set Thinking,” by W. L. Duren, in The Mathe- 
matics Teacher, May, 1958, for an interesting review of the ideas of this chapter 
and a preview of some ways in which they will be used later in the book. 


ANSWERS TO EXERCISES 


(a) 3, odd numbers; that is not divisible by 2. (b) He; He, {my father}; 
He, is my father. (ὁ) @; x is prime and x is even. (4) He € (A’N W’) 
(e:) HHE(HNWNS). (f)HeE(HNM). (ὦ R= P. (hb) Evensn 
odds = @. (i) Men C mortals. (j) F = ὦ where F = {x|z lives forever}. 
(k) ACB where A = triangles with two equal angles, 6 = triangles with 
two equal sides. 

(m) Tr € Po. (n) Eq = EL. (0) ObCR’. (p) ~(Is C Eq’). (q) ~(Od C P). 
(r) JCF, where J = integers, F = numbers having one as a factor. 
(8) [Kings C humans A humans C beings motivated by self-interest] — 
[kings C BMSI]. (Ὁ No AisC, by [ACBA BCC] > (ACC). 

(u) Some BisC, by[A CB A ~(A CC’)] — ~(B CC’) by (8-8-15). (ν) Not 
all Cis A, by [A © B’ A ~(CC B’)| -5 ~(C'C A) by (3-8-17). (w) Some 
laws are not satisfactory. We have ~(Comp G 17), (Conf C Sat’), 

(Comp € Conf). From the last two by (3-8-2), (Comp C Sat’). From this 
and the first by (3-8-15), ~(Sat’ C L’) or ~(L ΕΞ Sat). (x) No. 


ANSWERS TO PROBLEMS 


1. DCA, Vex EDnrEdA 382 MNW=Q, ~X2e2 EM AE W. 
δ. WN Blon Blu ¥ @ A WN Blon Bl’ = @ A WN Blo’ ἢ Blu = @. 
azz EC WA xcEeBloA xeEBluA rxeGCWAxEeBloA ΣΕ Bu A 
fazxEWwWA «xEBlo’ A xe Blu. 7. EsCEa. Vz [zx has two equal sides > x 
has two equal angles]. 9. ~(P C D). 11. ~(E C P’). 12. ~+(A CB). 

13. Vax [p(x) — q(z)] A ~ Va [ᾳφ(α) — ρ(α)). 15. Ja p(x), 17. aE PUQ. 
19. (4). 21. Some Bis C. 22. Valid. 23. Invalid. 24. No conclusion. 25. In- 
valid! No conclusion [see Symbolic Logic by Lewis Carroll (C. L. Dodgson) 
p. 63]. 26. No kitten with green eyes will play with a gorilla. 29. In 
Problems 7 and 8. 


CHAPTER 4 
PLANE ANALYTIC GEOMETRY 


4-1 Ordered pairs. Only rarely are we concerned with a single object 
in isolation; usually we are interested in objects related to other objects. 
In the simplest case we have two objects paired. For example, with each 
finite set, we may pair the number of its members; with each item in a 
store, we may pair its price; with each time, we may pair the velocity of a 
moving body; and so on. For definiteness, let us think of the items in a 
store and their prices. We number the item and give price in cents, 50 
that item 1 has price 5, item 2 has price 103, and so on. We could make a 
table showing this information. Each pair, consisting of an item and its 
price, makes up a set of two members, for example {1, 5}. However, sup~- 
pose that the number and price of an item were the same, for example, 
that item 6 has price 6. Then we should have for this item {6, 6} or {6}, 
which is no longer a pair. Or suppose that item 10 has price 15 and item 
15 has price 10. Then we wish to distinguish these two pairs, but 
{10, 15} = {15,10}. Accordingly, we see that the simple concept of 
a pair of numbers is not adequate in this situation. We have to be able 
to distinguish which member of the set is the item and which the price. 

A pair in which we distinguish one of the members as the first and the 
other (which need not be different) as the second is called an ordered pair. 
The ordered pair whose first member is x and whose second member is 
y 1s symbolized by (1, y) and is read “x,y” or “the ordered pair x, y.” 
The essential property of an ordered pair is given by the following axiom. 


(1) Ax. [(α, δ) = (ς, 4)] = [a =c A b= dl. 


(a) Show that [{a,b} = {c,d}] = [a Ξ 5 A ὃ = ἃ] ἰ5 ποῖ 8 law. (b) Prove 
(2). κ(ο) We could define (x, y) by (a, y) = {{x, y}, {a}} and then prove (1). 
Do this. 


(2) [(a, ὃ) = (6, a)] = [a = 9]. 


In Section 1-2 we associate single numbers with points on an axis in 
such a way that to each real number there corresponds one and only one 
point, and to a point one and only one real number. In Section 1-4 and 
later sections we make use of this one-to-one correspondence to interpret 
numbers and operations on them in terms of points and vectors. For 
example, we interpret ὃ — a as the directed distance from a to ὃ and its 
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absolute value as the undirected distance between a and b. Also, we 
think of a sentence as determining a graph, namely, the set of points cor- 
responding to the truth set of the sentence. For example, we view 
{c|a < x < b} asthe closed interval between a and ὃ. (See Section 3-4.) 


(d) Review previous sections, noting the way in which we used the corre- 
spondence between numbers and points on an axis to interpret numbers, opera- 
tions on numbers, equations, and inequalities. (6) Is the correspondence 
between sentences and sets one-to-one; i.e., is there just one defining sentence 
for each set of points and just one set of points for each defining sentence? 


The significant thing about the correspondence between single numbers 
and points or vectors on an axis is that it permits us to deal with geometric 
problems by algebra and to visualize algebraic ideas geometrically. The 
idea can easily be generalized by establishing a correspondence between 
ordered pairs of numbers and points in a plane. The interpretations of 
operations generalize naturally. As before, sentences have geometric 
interpretations, and geometric figures (sets of points) can be dealt with 
through their defining sentences. The idea of establishing a correspondence 
between real numbers and points, developed by Descartes and others in 
the early seventeenth century, is the key idea of analytic geometry. 
It brought about the reconciliation of algebra and geometry, which had 
been separated since Greek times, and led directly to the invention of the 
calculus, which has played such a significant role in modern science. 

The purposes of this chapter are (1) to show how the key idea of ana- 
lytic geometry applies to ordered pairs of real numbers and points in the 
plane, (2) to introduce the student to the algebra of plane vectors, (3) to 
familiarize the student with the graphs of certain frequently occurring 
equations and inequalities, and (4) to utilize analytic geometry to develop 
a skill in the algebra of real numbers, logic, and set theory. Additional 
topics in analytic geometry are included also in later chapters. 


PROBLEMS 
Solve the following sentences: 
1, (2, 3) = (4, 3). 2. (x, 1) = (1, 2). 
3. 27+ y? = 0. 4. (x, y) = (22, 3y). 


ANSWERS TO EXERCISES 


(a) See (3-3-19). (0) Immediate from (1). Note the use of p A p = p. 
(6) No. There is just one set of points corresponding to each sentence, but any 
two logically equivalent sentences define the same set. 


ANSWERS TO PROBLEMS 
lzrv=432=y = 0. 
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4-2 Plane coordinates. There are many ways of associating points in 
a plane with numbers. The one most commonly used is based on setting 
up two axes with common origin and with positive directions to the right 
and up. As indicated in Fig. 4-1, it is customary to call the horizontal 
axis the x-axis and the vertical axis the y-axis. The axes divide the plane 
into four regions called guadrants, numbered as indicated in the figure. 
y 
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Ficure 4-1 Figure 4-2 


Figure 4-3 


Now we imagine a line drawn parallel to the y-axis (perpendicular to 
the z-axis) through each point on the z-axis (Fig. 4-2). Similarly, we 
imagine lines parallel to the x-axis (perpendicular to the y-axis) through 
the points on the y-axis (Fig. 4-3). Each point in the plane lies on one and 
only one of the vertical lines and one and only one of the horizontal lines. 

(a) Why? 

The vertical line on which a point lies indicates the directed distance of 
the point from the y-axis. The directed distance is precisely the linear 
coordinate of the point where the vertical line crosses the z-axis, i.e., the 
number corresponding to the point on the z-axis. We call this number 
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the x-coordinate of the point. Similarly, the horizontal line indicates the 
directed distance from the z-axis. This directed distance is the linear 
coordinate of the point where the line crosses the y-axis. We call this 
number the y-coordinate of the point. Evidently, to any point there cor- 
responds a unique ordered pair of real numbers (x, y), where z and y are 
respectively the x- and y-coordinates. 

Conversely, to each ordered pair (x, y) there corresponds a unique point 
lying at the intersection of the vertical line at a directed distance x from 
the y-axis and the horizontal line at a directed distance y from the z-axis. 

In Fig. 4-4 we sketch the point corresponding to (5, —2). The directed 
distances are suggested by vectors. 


For each of the following draw and label a pair of axes; plot, circle and label 
points; and show directed distances, as in Fig. 4-4: (Ὁ) (5,2), (ὁ) (--2, 5), 
(d) (5, 2), (e) (2, ὅ), (ἢ (—5, —2). 

(g) through (n) Label the points in Fig. 4-5 so as to indicate approximately 
the ordered pair of real numbers corresponding to each one. 


FIguRE 4-4 Figure 4-5 


We can now say precisely what we mean by a solution of a sentence in- 
volving two variables. We call an ordered pair (a, δ) a solution of a sen- 
tence f(x, y) if and only if f(a, b). We then say that (a, δ) satzsfies the sen- 
tence f(x, y). The set of all solutions of a sentence f(z, y) is then a set of 
ordered pairs, which is the truth set of the sentence. We designate the 
set of all solutions of f(x, y) by {(2, y) |f(z, y)}, which is read “the set of 
ordered pairs (z, y) such that f(x, y).” This notation may be formally 
defined as follows. 


(1) Def. ἰῷ, ψ) |f(z,y)} = (2|32 dy f(x,y) A lz = (ὦ, yi}. 
Its essential property is 


(2) [(a, ὃ) & {(a, y) |f(@, y)3] fa, 6). 
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Just as we used a numeral or a single numerical variable to represent 
both a number and its corresponding point, so now we use an expression 
of the form (2, y) to stand for any ordered pair of numbers or for the cor- 
responding point. Similarly, just as we used {x]f(x)} to represent both 
a set of numbers and a set of points, so now we think of {(z, y) |f(a, y)} 
as both a set of ordered pairs and as the set of corresponding points. We 
call the set of ordered pairs of points defined by f(x, y) the plane graph 
of f(x, y). 

The plane graph of a sentence is simply the set of points whose co- 
ordinates satisfy the sentence. A point lies on the graph if and only if its 
coordinates satisfy the sentence, This criterion can be used directly to find 
the graphs of simple sentences. For example, the sentence zx = 3 A y = 4 
is satisfied only by the pair (3, 4). Hence the graph consists of a set of 
points with only one member, namely, the point (3,4). Symbolically, 
i(z,y)|e@ = 3 A y = 4) = {(3,4)}. Now {(x,y)|z = 3} is the set 
of all pairs with first component 3. The corresponding points make up 
the line parallel to the y-axis and three units to the right of it, as sketched 
in Fig. 4-6. In the same figure we have graphed {(z, y)|y = 4}. But 
iz, |e = 3 A y= 4} = {(z,y) |e = 838} ἡ [(, y) ly = 4} = {(3, 4)}, 
since the two graphs intersect in the single point already found. Note 
that we use the defining sentence of a graph as a name for it. 


Sketch the following: (0) ᾧ = —lly = 3,24 =—1lAy=83; () ὦ Ξ 
—“l1Vy=83; (Vy =z. 


The reader has observed that when one variable is missing from a de- 
fining sentence, this means simply that no limitations are placed on the 
values of that variable. For example, the plane graph of ᾧ < 3 consists 
of all those points whose 2z-coordinate is less than 3, i.e., all those points 
to the left of the line x = 3, as sketched in Fig. 4-7. 


Sketch and describe in words: (r) y>1, (s)a>0, (ὃ e2+ y? = 0. 


FIGure 4-6 FIGURE 4-7 
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It is often difficult to discover the nature of the graph when the range 
of the variables in a sentence is the entire set of real numbers f,. Even 
in the simple cases we have considered above, the graph has usually been 
of infinite extent, so that we could sketch only a part of it. However, if 
the variables are limited to a range consisting of a finite number of values, 
the graph of any sentence may be determined by testing each point to see 
whether it satisfies the sentence. Usually many points can be eliminated 
as obviously not solutions. Suppose, for example, that we limit the range 
of the variables to {0, 1, 2,3, 4}. Then 


{(a, y) |x = 3} = (ὦ, 0), (3, 1), (, 2), (3, 3), 3, 4}. 


The graph is sketched in Fig. 4-8. 


FIGURE 4-8 
Let the range of variables be {0, 1, 2,3,4}. Graph: (u) 7 =2V y = ὃ, 
vV)z>2 WwWee<2Ay>l WyS4, σ᾽ ͵ἑ 2 τ: 8λ2- 


y<3 ἡ 2 χϑϑν 2Ξνυ 3 3. 


PROBLEMS 


In which quadrant does (a, ὃ) lie if 


lLa<OAO6>0. 2,.—a<0A6>0 
3.€4a@<0A6< 0. 4,.a@>0A6<0. 
In which quadrant does (—a, ὃ) lie if 

5.a>0A6> 0. 6a<0A6< 0. 
7.a>0A—b> 0. 8. —a>O0A —b> 0. 


9, Complete the following: The members of an ordered pair are called 
{ὸ. ς of the corresponding point. 


In Problems 10 through 21 the range of the variables is {0, 1, 2, 3,4}. De- 
termine the set of ordered pairs defined by each sentence, and graph. 
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10. x > 3. ll. y < 2. 

12,0 < y < 3. 13.2 <2 A y- <2. 
14.2 = —1V y = 2. 1.2 =O0V y= 0. 
16y=2A0<2< 2. 17. ἡ Ξξ ἄς. 

18. y = 2x. 19.¢+ y = 4. 
20. y = (1/2)z. 21. ἡ = V2. 


In Problems 22 through 31 the range of the variables is the real numbers. 
Sketch the graph of each sentence, using whatever method you wish. 


22. 4 = 0. 23. y = 0. 
24. ry = 0. 25.2=1Vy< 0. 
26.2 =1Ay< 0. 2.y=2A—-l1<2< 1 
28. xy > 0. 29.z-+y = 4 
30. [1] = 2. 8l.erty=4A2>0Ay2>0 
*32. Show without reference to geometry: {(z, y)x = 3} //{(z,y)|x = 4}. 
Graph 
*33. {(x, y)|zy > 0}’. 
#34. {(x, y)|zy > 0} ((, y)||2| < 1) 
*35. {(2,y)|cy > 0} U {(z,y)||2] < 1} 


ANSWERS TO EXERCISES 


(a) Through a given point not on a line there is one and only one line parallel 
to a given line. This is Euclid’s famous parallel axiom. (b) through (f) 
Fig. 4-9. (g) (—2.5, 2). (h) (0,1.6. (ὁ (2,1). (j) (0,0). (kK) (3, 0). 
(1) (2.5, --1.2). (m) (0.5, —2).. (0) (—1.8, —1.2). (0) First is line paral- 
Jel to y-axis one unit to left; second is the parallel to z-axis three units above; 
third is point of intersection of these. (p) Figure consisting of the two lines 
graphed in Exercise (0). (q) Line bisecting first and third quadrants. (r) All 


FIGURE 4-9 
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of plane on and above liney = 1. (8) Allof plane to right of y-axis. (t) The 
origin. (u) 9 points. (v) 10 points. (w) 6 points. (x) 15 points form- 
ing triangle. (y) 4 points forming a square. (2) 16 points forming a cross. 


ANSWERS TO PROBLEMS 


1. Il. 3. IL. 5. 11. 7. III. 9. coordinates. 13. {(0, 0), (0, 1), (1, 0), C1, 1)}. 
15. 9 points on the axes. 17. {(0, 0), (1, 1), (2,4)}. 19. {(0, 4), ((, 3), (2, 2), 
(3, 1), (4, O)}. 21. {(0, 0), C1, 1), (4, 2)}. 28. z-axis. 25. Fig. 4-10. 26. Fig. 
4-11. 27. Fig. 4-12. 29. Straight line through (0, 4) and (4,0). 31. Portion 
of 29 in quadrant I. 


" 


FIGuRE 4-10 FIGuRE 4-11 


(—1, 2) (1, 2) 
y=2A-ls2e81 


xv 


FIGURE 4-12 


4-3 Plane vectors. In Section 1-4 we established a correspondence 
between single real numbers and vectors on an axis, which we used to give 
vector interpretations of alegbraic operations. We now generalize this by 
establishing a correspondence between ordered pairs of real numbers and 
vectors in a coordinate plane., 

An ordered pair of real numbers (a, ὃ) determines a vector from the 
origin to the point (a,b). As in Section 1-4, we think of the vector as 
moving about without changing its length or direction, which is suggested 
in Fig. 4-13. Wherever it is located, the vector corresponding to (a, ὃ) 
is determined by the property that if one starts at the initial point and 
goes a horizontal directed distance a and then a vertical directed dis- 
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(a, δ) (a, δ) 
b 
= (a, b) + (c,d) Ἂς Ned 
(c, d) 
ee ened 
] 
| Ϊ: 
Ee a ee . 
α Cc 
ate 
FicurE 4-13 FIGURE 4-14 


tance ὃ, one arrives at the terminal point. The axes are required only to 
determine the horizontal and vertical directions and the scale of measure- 
ment. Conversely, if we have any vector, we can draw lines through its 
initial and terminal points parallel to the axes and so determine the cor- 
responding ordered pair. Hence there is a one-to-one correspondence 
between ordered pairs and vectors. Because of this we use a name of an 
ordered pair as a name of the corresponding vector, and we think of (a, b) 
as an ordered pair, a point, or a vector, as convenient. We call a and ὃ 
the components of the vector (a,b). They are indicated graphically in 
Fig. 4-13. | 

Sketch the following vectors, showing their components as in Fig. 4-13: 
(a) (1,38); (Ὁ) (—3, 7); (0) (4,0), (0, —1); (4) (4, --1);  (f) (—83, —3). 

Our procedure for adding vectors in the plane is the same as on a line. 
To find (a, δ) + (c, 4) we place the initial point of (c, d) on the terminal 
point of (a, ὃ) and then find the vector from the initial point of (a, b) to 
the terminal point of (c, d), as in Fig. 4-14. Note from the figure that the 
components of the sum are sums of corresponding components. 


Sketch the following sums as in Fig. 4-14: (ρ) (8, 5) + (4, 2); 
(h) (3,5) + (—1, —2); (i) (4, —1) + (—2. —8). 


The above discussion suggests the following definition of the sum of 
two ordered pairs of real numbers. 


(1) Def. (a,b) + (c,d) = (a+c¢,b+4 a). 
By (6:0, c:0, d:b) we have immediately 
(2) (a,b) = (a, 0) + (0, δ). 
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Since (a, 0) is a horizontal vector, and (0, b) is a vertical one, (2) says that 
any vector is the sum of a vertical and a horizontal vector. We call (a, 0) 
and (0, δ) the component vectors of (a,b). They are simply the vectors 
labeled a and ὃ in Fig. 4-13. 

We see immediately from (1) that 


(3) (a, δ) + (0,0) = (α, ὃ), 


which corresponds to (1-16-7). Hence it appears that (0, 0) plays the 
same role in addition of vectors that 0 did in the addition of real numbers. 
We call (0, 0) the null vector. 


(j) Show that (0,0) is the only vector with the property (3); that is, that 
[(a, δ) = (x, y) = (4, b)] — (ὦ, y) = (0, 0). 


We saw in Section 1-4 that the vector —a has the same length as a but 
is opposite in direction. Figure 4-15 suggests that to reverse the direction 
of a vector is to negate its components. Accordingly, we define 


(4) Def. | —(a,b) = (—a, —b). 


We have immediately that (a, ὃ) + (—(a,b)) = (a,b) + (—a, —b) = 
(0, 0), which corresponds to (1-16-9). Hence the negative plays the same 
role here as for real numbers. 

We define subtraction by analogy with (1-9-18), which is equivalent 
to (1-14-19). 


(5) Def. (a,b) — (c,d) = (a,b) + (—(€6, d)). 
This gives immediately | 
(6) (a, b) = (c, d) π- (α — ¢,; ὃ — d). 


It is not hard to see that (a,b) — (c,d) is the vector from the point 
(c, d) to the point (a, ὃ), that is, the directed distance from (c, d) to (a, 6). 


ῳ ἐπ δ —d) 
(a, b) 


(a -- ο, ὃ -- dj) 
@ee hw 


(c, d) 


Figure 4-15 FIGuRE 4-16 
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vy 


Vy + V9 


i a 


vy 


Figure 4-17 FIGURE 4-18 


In Fig. 4-16 we see that the vector (a — c,b — d) obtained by adding 
the vectors (a, ὃ) and (—c, —d) is the same as the vector from the point 
(c, d) to the point (a, b). Hence, just as for single real numbers, we inter- 
pret a difference as a directed distance, (a,b) — (c,d) being the directed 
distance (the vector) from the point (c, d) to the point (a, b). 


Sketch the following points and the directed distance from the first to the 
second in each case: (k) (2, —5), (5,1); (1) (5,1), (2, —5); (m) (3, 5), 
(6, 10). 


Do vectors satisfy the commutative and associative laws of addition? 
We leave it to the reader to prove that they do. Figures 4-17 and 4-18 
suggest their plausibility. 


(n) Prove that (a,b) + (c,d) = (c,d) + (α, ὃ). (0) Prove that (a,b) + 
((c, d) + (e,f)) = ((α, ὃ) + (ο, d)) + (ο, ἢ). 


The above discussion shows that with our definitions, ordered pairs of 
real numbers (plane vectors) follow the axioms of algebra relating to 
addition, subtraction, and negation. Could we define multiplication of 
ordered pairs so that all the other axioms are satisfied also? The answer 
is shown to be yes in Section 6-13, but in this section we shall merely define 
the product of an ordered pair of reals by a real number. 

In Section 1-4 we interpreted the product ab as the vector found when 
the vector ὃ was laid out a times. Figure 4-19 shows a vector added to 


FiaurE 4-19 
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itself 3 times. Because of the similarity of the triangles ABC and AB’C’, 
we have AB’/AB = c/a = d/b = 3. Here the components of the product 
are obtained by multiplying the original components by 3. This suggests 
the definition. 


(7) Def. k(a,b) = (ka, kb). 


(p) Sketch (3, 4), and 2(8, 4), and (3,4)+ (8, 4. (q) Show that the 
vectors (a, δ) and k(a, b) always lie in the same or parallel lines. 

Although we have not defined the product of two vectors, we have 
defined addition, subtraction, and multiplication by a real number. 
Since addition and subtraction were shown to satisfy the same axioms as 
the real numbers, we can use laws relating to real numbers that were 
derived from these axioms. We can also use any laws we can derive from 
the definition (7) and the multiplicative properties of real numbers. 

(r) Prove that 


(8) k[(a, Ὁ) + (6, d)] = k(a, δ) + Κίς, a). 


PROBLEMS 


Simplify the expressions in Problems 1 through 6 and draw diagrams indicat- 
ing the operations and result. 


1. (2, —4) + (3,1). 2, (4,1) = (2.0); 
3. 2(5, —3). 4, (—5)(1, —1). 
5. (1/2)(4, —6). 6. (—1, 2) + (3, —5) + (—2, —2). 
7. Prove that 
(9) (a,b) = a(1, 0) + δί(0, 1). 


(This result is described by saying that every vector is a linear combination of 
horizontal and vertical unit vectors.) 

8. Let (1,0) = δι and (0,1) = e2. Express the following in the form 
ae, + bee: (a, —1), (4, 3), (—5, —1.7). 

9. Show that 


(10) 1 + (a, δ) 
(11) (—1)(a, δ) 


(a, δ). 
—(a, b). 


I 


10. To what Jaws for real numbers do (10) and (11) correspond? 

11. How would you define division of a vector by a real number? 

12. From the definition in Problem 11 show that (1-9-29)(a:(a, 6)) is a law. 

13. Similarly show that (1-9-30) through (1-9-32) hold for vectors. 

14. Prove that (1-9-33) through (1-9-37) hold if we let the range of the 
first variable be the real numbers and let the range of the second be the set of 
all ordered pairs of real numbers. 
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15. Show that (2-8-3) and (2~8-4)(0:(0, 0)) hold for plane rectors. 


In Problems 16 through 21, solve for the unknown ordered pair. 


16. 2(5, y) = (8, 5). 

17. —3(z, y) = 2(x, y) + (1, —8). 

18. x(x, y) — (2,4) = (8, —1) — ἴα, y). 
20. ΩΣ — (—3, —2) = (4, —10). 

21. 52 = z+ (2, —1). 


ANSWERS TO EXERCISES 


(a) through (f) See Fig. 4-20. (g) (7, 7). (h) (2, 3). (i) (2, —4). 
(1) (a, δ) - (ὦ, y) = (a, 6) > (a+ 2,b6+y) = (a, δ) a+r =aAb+y Ξ ὃ. 
(k) (3,6). (1) (—3, —6). (ὦ) (3,5). (ἃ) (a, 6)+ (ο, d) = (« - ς, ὃ -ἰ- 4) = 
(c-+ a,d+ δ) = (c,d) + (α, ὃ). (0) Similarly. 


(4, 0) 
(0, =) 
(1,3)/ [8 ; 
ὡς το [-: 
(4, —1) 
1 Zs 
Figure 4-20 


(p) 2(8, 4) = (8, 4) -+ (8, 4). (q) The triangles formed by the vectors and 
their component vectors are similar since they are right triangles whose legs 
are proportional. Since their legs are respectively parallel (or in the same line), 
so are their hypotenuses. 

(r) Left member = k(a+ c,b-+ d) = (ka + ke, kb + kd) = (ka, kb) + 
(ke, kd) = right member. 


ANSWERS TO PROBLEMS 


1. (5, —3). 3. (10, —6). 5. (2, —3). 7. Simplify right member. 9. From 
definition (7). 11. (a,b)/k = (1/k)(a, δ). 13. Immediate from definition. 15. 
The proofs would be unchanged. 17. (—1/5, 3/5). 19. (8.5, 5). 21. (1/2, —1/4). 
Note that there is no objection to using a single variable to stand for an ordered 
pair! | 
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4-4 Distance in plane geometry. On an axis the distance between two 
points a and 6 is the length of the vector, 6 — a, from one to the other. 
Similarly, in the plane the distance between two points (a, ὃ) and (ce, d) 
is the length of a vector joining them. Such a vector is the difference of 
the vectors (a,b) and (c,d), that is, (c,d) — (a,b) or (ce — a,d — ὃ). 
But how shall we define the length of a vector? Figure 4-13 and the 
Pythagorean theorem suggest the answer: since the length of a vector is the 
length of the hypotenuse of a right triangle whose other two sides have 
lengths equal to the absolute values of the components, the length of a 
vector is the principal square root of the sum of the squares of its compo- 
nents. Representing the length of the vector (a, b) by |(a, b)], we have 
in Fig. 4-13, |(a, b)|? = jal? + |b|? = αὐ + b?. This suggests 


(1) Def. |(a, b)| = Va? + b?. 


(a) Why did we say that the lengths of the sides in Fig. 4-13 are the absolute 
values of aandb? (Ὁ) Why do these absolute value symbols not appear in (1)? 
(c) Find the length of the vector (—8, 5). 


There is no reason why we should not use variables whose ranges are 
sets of ordered pairs, points, or vectors. We shall adopt the convention of 
using boldface type for this purpose. Thus ἃ is a variable for which we 
may substitute other boldface variables, formulas whose values are 
vectors, or expressions of the form (a,b). Then [ὰ] is the length of the 
vector u. We let 


(2) Def. — (0,0). 

Then 

(3) jul > 0, 

(4) (jul = 0) «» (ἃ = 0), i 
(5) [κα = [1] - ful, Fiaure 4-21 
(6) ju + v| < ful + [δ]. 


The geometric interpretations are as follows: (3) The length of a vector 
is non-negative. (4) The length of a vector is zero if and only if the vector 
if the null vector. (5) The length of a multiple of a vector is the absolute 
value of the multiple times the length. (6) If the vectors u and v do not 
lie in the same straight line, this becomes the famous “triangle inequality” 
of Euclid: the length of any side of a triangle is less than the sum of the 
lengths of the other two sides. (See Fig. 4-21.) 

These laws are “evident” geometrically, but they can be proved al- 
gebraically without any reference to geometry, and then the results 
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interpreted as proofs of the corresponding geometric laws! The proofs 
of the first three are easy. 

Prove: (d) (8), (e) (4, (ἢ) (5). 

To prove (6) we let u = (a, 6) and v = (c,d). Then 


u+v= (a+cb+ ἃ), 


and (6) becomes 

(6) Va +e)? + (b+)? < Va? +0? + Ve? + a? 

But (6’) is equivalent to 

(7) (a+c)? + (b+)? < a? +b? +4 6? 4 d? + 2V(a? + b%)(c? + a?) 


by applying (3-5-43), which is legitimate since the left member of (6’) 
is non-negative, and using (1-12-9) and (1-13-22). Continuing, (7) is 
equivalent to 


(8) a” + 2ac + c? + b? + 2bd + a? | 
< α + b? + ce? + d? + 2V (a? + b7)(c? + d?) 


Now subtracting common terms from both sides and dividing by 2, we 
have the equivalent inequality, 


(9) 66 - bd < V (a? + b?)(c? + d?). 


Now if ac + bd < 0, (9) is certainly a law, and the theorem is proved. 
If ac + bd = 0, then we can apply (3-5-43) again to get the equivalent 
inequality, 


(10) a*c? + 2abed -+ 0343 < a2c? + a2d? + b%e? + bd? 
Simplifying and subtracting 2abcd, we find 


(11) 0 < a*d? — 2abed + b7c?, 

(12) 0 < (ad — be)?. 

The last sentence is certainly a law. Since we have proved that (6’) is 
logically equivalent to it, the proof is complete. (Compare Section 2-8.) 


(g) Indicate the substitutions in (3-5-43) to get the logical equivalence of 
(6’) and (7). (h) How are (1-12-9) and (1-13-22) used? (i) Indicate the 
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laws and substitutions used to get from (7) to (8), (7) from (8) to (9). 
(k) Justify in detail (9) «9 (10). () Why is (11) <> (12)? (m) Prove (13). 


(13) [—u| = |u| 


We can now easily derive a formula for the distance between two points 
(a,b) and (c,d). This distance is the length of either of the vectors join- 
ing the two points, and these vectors are (c,d) — (a, δ) and its negative 
(a,b) — (c,d). By (18) it makes no difference which we take. Hence the 
distance is given by the length of (ες — a, d — ὃ), that is, by 


(14) i(c, d) — (a, b)| = V(e — a)* + (ὦ — δ)5. 


Thus we get the distance between two points by subtracting their co- 
ordinates, squaring, adding, and taking the principal square root. 


Find the directed distance from the first to the second point and the distance 
between them: (n) (3, —5), (7,8); (0) (V2, —8), (1+ V2, V5 — 8). 


In plane geometry a locus is described as a set of points satisfying some 
condition. In analytic geometry, we often have the problem of finding a 
sentence defining a certain locus, i.e., such that the locus is the graph of 
the sentence. For example, a circle is defined as a locus of points equidis- 
tant from a fixed point. Suppose the center is at (h, k) and the radius is r. 
Then a point (2, y) lies on the circle if and only if the distance between 
(x,y) and (h,k) is r, that is, if and only if [|(Σ — h,y — k)| = 7, or, 
since both members are non-negative |(x — h, y — k)|? = r?. Hence a 
defining equation of the circle with center (h, k) and radius r is 


(15) (2 — h)? + (y — hk)? = 7°, 


A point lies on the circle if and only if its coordinates satisfy equation (15). 


Write the equations of a circle with given centers and radii, and sketch them 
in the coordinate plane: (p) (3, 4),1; (q) (0,0),1; (ἡ) (4, —8), 2. 


A standard problem of plane geometry is to bisect a segment. Let us 
see how we can do this with vectors. Figure 4-22 shows the segment 
joining (71, ¥1) and (xo, yz) with the midpoint (%, 7), and also shows the 
vectors from the origin to the points, where uy = (1, ¥1), U2 = (25, ¥2), 
and 1 = (%, 7). Note that we think of the same symbols as representing 
points or vectors, as convenient. Now uw is the midpoint if and only if 


(16) Ug — Uy = 2( — wy), 


which follows from our discussion of the meaning of multiplication of a 
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(X41, 1) (Z, 9) (Xa) Y2) 


FIGURE 4-22 


vector by a real number [see Fig. 4-19 and (4-3-7)]. We can solve (16) 
for u by the usual rules of algebra. We find 


(17) u = (1/2)(u, + wp). 


Note that the midpoint is simply the average of the endpoints. To get 
formulas for the coordinates, we substitute the coordinate forms for the 
variables in (17) to get 


(17) (2, 7) = (1/2)[(a, y) + (2, y)] 

(177) = [(1/2)(@1 + 22), (1/2)(y1 + ψ2)}. 
From this we have, as expected, 

(18) = (σι + %2)/2, Y= (yi + ψ2)22. 


Find the midpoints of: (8) (8, —4), (2,16); (ὃ) (—1, —1), (—17, —3). 
(u) Carry through the algebra to get (17) from (16). (v) Derive (17) from 
the observation that we must have ue — ἃ = ἃ — uy. 


PROBLEMS 
In 1 through 4 find the distance from the origin to the point. 
1. (3, —4). 2. (1,0). 3. (0, —A4). 4, (—I1, —7). 
o. Under what conditions will the equality in (6) hold? 


In 6 through 9 sketch the two points, the directed distance (vector) from the 
first to the second, and its component vectors. Find the distance between them. 
6. (4, —10), (0, 1). 7. (350, 0), (352, 4). 
8. (1.6, —3.7), (2, —3.3). 9. (V2, V5), (νῷ, —V5 
Sketch the circles in 10 through 13. 
10. («@ — 1)?+ y? = 18. 1, 4¢ +2)? = G4)? &-4, 
12. («@ — 2)? + y? = 4. 13. 27+ (y — 3)? = 9, 
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Sketch the graphs in 14 through 17. 


14. The union of the truth sets of Problems 10 and 11. 

15. The complement of the graph of Problem 12. 

16. The intersection of the graphs of Problems 12 and 13. 

17. The intersection of the graph of Problem 10 with {(z, y)|z < 0}. 


18. Show that (« — h)? + (y — k)? < r? defines the region interior to and 
on the circle. 


Sketch the graphs of the sentences in 19 through 22. 


19. 2 +y?<1A2e>Qd. 20. ~(x? + y? < 1). 
21. 22+ y2 =9V 22+ (y— 1)? < 16. 
W22,e72° + y7° FON Y <2. 


23. Show that if two points lie on the z-axis, then (14) gives the distance 
between them as the absolute value of the difference of their coordinates. 
Generalize. 

24. In what sense can it be said that the length of a vector behaves “the same” 
as the absolute value of a real number? | 
*25. If a point ἃ lies 2 of the way from one point u; to another ug, then 
u — u; = (%)(ug — αι). Use this idea to find the point 3 of the way from 
(3, 5) to (8, 2). Sketch. 

*% 26. Find the point 3 of the way from (1, —4) to (2, —8). Sketch. 

* 27. Derive a formula for the coordinates of a point P lying on the line de- 
termined by the points P; and Pe in such a position that the directed distance 
from P, to P is r times the directed distance from P to Pe. 

* 28. What is r for the bisector? for Problem 25? 

*29. Where is Pifr<1? r>1? r<0? -—-1l<r<0? r<—l? 
Illustrate with examples. 

* 30. Let capital letters be variables standing for points and PQ = the distance 
between P and Q. Prove that | 


(19) PQ = 9, 

(20) (PQ = 0) (P = 9), 
(21) PQ = QP, 

(22) PQ < PR+ RQ. 


ANSWERS TO EXERCISES 


(a) Lengths are never negative, but α and ὃ could be. (Ὁ) (—a)? = a?. 
(c) 1/34. (4) Principal square root is never negative. (6) Squares are non- 
negative, so sum of squares is zero if and only if each is zero. (f) Put (ka, kb) 
for (a,b) in (1). (6) The left and right members of (6’) are substituted re- 
spectively for a and b. (8) To evaluate square of right member of (6’). 
(i) (1-12-9). (j) (3-5-15). (k) (3-5-43), (1-18-19). (1) (1-12-10). 
(m) (1)(a:—a, b:—b). 
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(n) V185. (0) V6. (p) ὦ -- 3)? + y¥—42 = 1. (q) 22+ y2 = 1. 
(r) (ὦ — 4)? + Y+ 38)? = 4. 6) (2.8,6. (ἢ (9, —2). (ὦ we — 
Uy = 2u — 2uj;u2g+ αὶ = 2. (v) Add w+ αἱ to both members. 


ANSWERS TO PROBLEMS 


1. 5. 8. 4. 5. When the vectors have the same direction. 7. 20/5. 9. 20/5. 
11, Center at (—2,4), radius 1. 13. Center at (0,3), radius 3. 15. All the 
plane but the circle. 17. The portion of the circle (an arc) lying to the left 
of and on the y-axis. 19. The part of the interior and circumference of the 
circle to the right of the y-axis. 21. Both loci, a circle and a circle together 
with its interior. 23. Here d = ὃ = 0 and we use (1-13-20). 25. (19/3, 3). 
27. (ui + rue)/(1+ γ). 29. Left of midpoint; right of midpoint; outside seg- 
ment; to left of segment; to right of segment. 


4-5 The straight line. To study the straight line by analytic methods, 
we must find defining sentences for straight lines. For this purpose we 
need to specify what we mean by a straight line. We recall that if u is a 
vector, then ku is a vector in the same line With u and of length |k} ful, 
in the same direction as u if k > 0 and in the opposite direction if k < 0. 
Figure 4-19 sketched the case k = 3. Figure 4-23 shows cases k = —3 
and k = 0.5. If we accept the theorems of plane geometry, we can prove 
that a point (x, y) lies on the straight line through (#1, y1) and (29, y¥¢) 
if and only if the vector (2, y) — (21, y,) is some multiple of the vector 
(22, Y2) — (αι, ¥1), as sketched in Fig. 4-24. Instead, we define the 


0.5a 


Figure 4—23 
(rv, y) 


(U2, ya) 


(ει, ¥p) 


is = 1ys =) 
(ie αν} 7 


Figure 4-24 
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straight line through two points to be the set of all points obtained in this 
way; that is, 

The straight line through (x1, y1) and (2, Y2) 1s by defination 
(1) {(a, y) |ak (2 — 241, y — yi) = Kee — τι, Y2 — yi)}. 
We will show that straight lines so defined do have the properties we 


expect. 
First we work with the defining sentence 


Ik (x — χα, — Y1) = κίας — 41, y2 — Y1)- 


(2) @— 3k [x — xy = kre — 21)) A [Ly — νι = Kye — ψι)}, 


(3) i es, oe 
v2 — 1 Y2— Yl 
(4) eoy-y= aes (2 — αι) (two-point form). 
2— 1 


The equivalent sentence (2) results simply from using the definition of 
multiplication and (4-1-1). The next step results from dividing both 
members of the first term in the conjunction by xz — 2x; and the second 
by ya — y1. To see the equivalence of (4), we note that if (3) is true then 
the two fractions are equal, from which (4) follows. Conversely, if (4) 
is true the two fractions are equal, and so are equal to some k! The argu- 
ment leading to (3) fails if x2 = 21 or yz = yi, that is, if the two points 
are in the same vertical or horizontal line. We ignore these cases for the 
moment. 
The equations in (2), 


(5) ὦ -- τι -- Κίς -- 14}, y= ψι t+ hye — yi) (parametric form), 


define the straight line in the sense that we get the points on the line by 
assigning values to k. The variable k is called a parameter to distinguish 
it from the variables x and y whose values are coordinates of points on the 
line. The equations in (5) are called parametric equations of the line. 


(a) Sketch the points (1,3) and (2,7) and write the parametric equations 
of the line through them; find the points corresponding to k = 0, 1,3, — 1 and 
graph them. (Ὁ) Do the same for (0, 0) and (—1, —5). 

Equation (4) is called the two-point form of the equation of the straight 
line. It is a single defining equation which is a necessary and sufficient 
condition for a point to lie on the line through (x1, y1) and (22, Y2). 

(c) Write the two-point form for the line in Exercise (a). Find points for 


which z = 1,2,4,0 and plot them. (4) Do the same for the line in Exer- 
cise (b). 
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Suppose that the two points are in the same vertical line. Then 21 = 22 
and (2) becomes Jka — σι =O Ay — y; = k(y2 — γι). Now for 
any Ψ, Yi, and y2 there exists a k satisfying the second equation (remember, 
since we assume that the points are different, we cannot have yg = yj as 
well as 22 = 2 ), so that the sentence is equivalent to x = x, (see 2-7-30). 
Hence, as we expect, the line is parallel to the y-axis and has the equation 
x = 2. Similarly, if yy; = ye, the equation of the line is y = ψι (com- 
pare Section 4-2). 

We can now find parametric equations or a single equation defining any 
line. But suppose we are given an equation. How can we tell whether it 
defines a straight line? For example, does 3x + 2y = 5, or y = 2”, or 
y = |x|? The answer to this question is given by (6). 


Any straight line has a defining equation of the form Ax + 
(6) By + C = 0 where ~(A = B = 0), and any equation of this 
form ts a defining equation of some straight line. 


The first part of the theorem is immediate, since (4) is equivalent to 
(Yo — yr)@ + (αι — t2)y + (to — 21)y1 — (Y2 — y1)X1 = 0, which is 
of the desired form, and so are both x — 1 = Oand y — y, = 0. 

To prove the second part, we suppose first that B = 0. Then A = 0, 


and the equation becomes x = —C/A, which we know is the equation 
of a straight line. Now, if B ¥ 0, the equation is equivalent to y = 
mx + ὃ, where m = —A/B, ὃ = —C/B. We find by substitution that 


(0, δ) and (—b/m, 0) satisfy this equation and hence lie on the graph of 
Az + By+C= 0. Now, by (4)(x1:0, y1:b, x2:—b/m, y2:0), the straight 
line through (0,b) and (—b/m,0) is y -- ὃ = [—b/(—b/m)](x — 0), 
which is logically equivalent to y — ὃ = mx or y = mz-+b. Since 
our original equation is equivalent to an equation that defines a straight 
line, it is a defining equation of a straight line, as we wished to prove. 

An equation of the form Az + By + C = 0 is called a linear equation, 
for obvious reasons. With theorem (6) we can easily graph such an equa- 
tion. We simply locate two points by substituting for one variable and 
solving for the other, then draw the corresponding line. The easiest 
points to find are those with one zero coordinate. For example, to 
graph 2x — 5y = 15 we substitute (x:0) to get —5y = 15 or y = —3. 
Hence (0, —3) is one point. From (y:0) we find (7.5, 0) as a second point. 
The line is sketched in Fig. 4-25. 


Find three points on the graphs of the following and sketch the line for: 
(6) 4a — y = 2, and (f) —38x-+ 2y —7 = 0. 

(g) Choose two points on the graph of Exercise (e) and use them to write 
parametric equations of the line. 
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ὕ 


(7.5, 0) 


22x—dy =15 
(0, —3) 
FIGURE 4-25 
Slope. The proof of (6) showed that any nonvertical line has an equa- 
tion of the form y = mz -+ ὃ. Consider two points on the graph of this 


equation, say (%1, ¥1) and (5, y2). Then 


(7) yy = may +b A yo = πιὰ + ὃ 


(8) > yy, — mt, = OA Yo — Mtg = ὃ 
(9) > Y1 — M2, = Y2 — M2 

(10) > m(x2 — 11) = yo- Ψι 

(11) - m = (Y2 — Y1)/(®%2 — 11). 


(h) Justify steps (8) through (11). 


The above argument shows that if we choose any two points on the 
eraph of y = mx + ὃ, the ratio of the difference of the y-coordinates to 
the difference of the corresponding z-coordinates is the same, namely m. 
This is what we would expect, since this ratio is precisely the ratio of the 
components of the vector joining the two points. As suggested in Fig. 4-26, 
all non-null vectors lying in the same straight line have components in the 
same ratio. (Note that when the direction of the vector is reversed, both 
components change sign.) 

It follows that associated with every nonvertical line there is a real 
number m given by (11). We call this number the slope of the line. 1} 


» (2 


FIGURE 4-26 
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we know the slope of a line, we know its orientation though not its exact 
position. 

Consider two points whose xz-coordinates differ by 1, that is, x2 — 
σι = 1. Then from (11), ye — ψι = m. This means that if we move 
along the line with slope m so that x increases by 1, then y will change 
by m. For example, if m = 2, y changes by 2 when x changes by 1. If 
m = —3, y changes by —3 when x changes by 1. For this reason the 
slope is sometimes called the change in y per unit change in x. If m > 0, 
the line slopes upward from left to right; if m = 0, it is horizontal; if 
m < 0, it slopes downward from left to right. These cases are sketched 
in Fig. 4-27. 

Find the slopes of the lines through: (i) (2,5), (4,6); (0) (—8, 2), 
(4,—5); (k) (2a, 1), (8a, δ). 

(1) What is the slope of the graph of y — 82 = 0? = (m) of 2x+ ὃν — 6 = 0? 
(Suggestion: Put in form y = mz-+ δ) (0) Show that the slope of a line is 
equal to the ratio of the second to the first components of any vector lying in it. 


An intersection of a locus with an axis is called an intercept, an x-intercept 
or a y-intercept. Since (0, b) satisfies y = mx + ὃ, we have the following 
result. 

The graph of y = mx + bis the straight line with y-intercept 


(12) (0, ὃ) and slope m. 


Theorem (12) can be used to graph quickly any equation of this form. 
We simply locate (0, b), then find a second point from the fact that for 
every unit motion to the right we must move m units vertically. For 
example, to graph y = —2z - 6, we plot (0, 6), then go 1 unit to the 
right and —2 units vertically to get the point (1, 4). The line is sketched 
in Fig. 4-28. The equation in (12) is called the slope-intercept form. 


FIGURE 4—27 FIGURE 4-28 
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Find slopes and intercepts, then sketch: (0) y = —5z, and (p) 142 -ἰ- 
2y —7 = 0. (q) Show that 


(13) y — νι = me — 21) (point-slope form) 


is an equation of the line with slope m passing through (71, yi). Find the equa- 
tion of the lines with slope and point as indicated for: (1) 2, (5,—1); (8) 0, 
(0, 0); (t) 0, (5, 4); (u) —7, =] —3). 

(v) Sketch lines with slopes 0, 10, —10, 1/2, 1/10, —1/10. (w) What 
kind of line has no slope? 


PROBLEMS 


Graph 1 through 6 by finding the intercepts. In each case find several 
other points on the graph and verify that they appear to lie on the straight line 
joining the intercepts. Calculate the slope. 


1. 8. — ἐν = 12. 2, 38a -[- 4y = 12. 
ὃ. στ y = 10. 4.2 — by = 18. 
5. 2n — 10 = 5y. 6. ἡ = ϑὺ ΤΊ. 


7. Suppose that an individual is going to spend $100 on two commodities 
whose prices are $25 and $10 per unit. Letting x and y be the number of units 
of each that he buys, we have 252 + 10y = 100. Graph the points representing 
his possible purchases, assuming that he can buy only whole units. 

8. Graph the possibilities of Problem 7 on the assumption that the individual 
can buy any real number of units. 


Show that each of 9 through 11 is the equation of a straight line by showing 
that it is logically equivalent to an equation of the form Az -ἰ By + C with 
~(A = B = 0), and graph. 


9 y = &. 10. x = 41. Ly = =, 


Graph 12 through 14 by using the intercept and slope. 
12. y = 105. - 1. 13. y = —z. 14. y 


(—3/10)a + 5. 


Find the slopes and the defining equations of the lines through the points 
in 15 through 17. 

15. (—3, —4), (2, —1). 16. (2, 5), (5, V2). 

17. (a, δ), (6, a). 18. (a, δ), (—a, δ). 


19. Graph the locus {(z, y)|3é2 = 2+ 3tA y = 3+ 1). 

20. Prove that the parametric equations « = xo + Atvand y = yo+ Bt, 
where ὁ is the parameter, define a straight line through (xo, yo) with slope B/A. 

21. Graph {(z, ψ)[3 2 = 1+ 3t A y = —1-+ 2b}. 

22. Graph the parametric equations x = —3 — δέ, ψ = 1-+ 4. 

23. When does a line have an x-intercept but no y-intercept? a y-intercept 
but no z-intercept? an infinite number of intercepts? z-intercept and y-intercept 
coincident? 

24, Show that when A ~ 0 and B ¥ 0 the intercepts of Ax + By+C 
= Ὁ are (0, —C/B) and (—C/A, 0). 
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*25. Show that z/a-+ y/b = 1 is an equation of a straight line with inter- 
cepts (0, 6) and (a, 0). (This is called the intercept form.) 
*26. Show that the intercepts of y = γὼ τ΄ ὃ are (0,b) and (—b/m, 0). 
What assumption do we make here? 

27. Explain why finding the x-intercept of the graph of y = mz -—- ὃ is the 
same as solving the equation mz -++ ὃ = 0. 

28. Solve 7x — 10 = 0 graphically by considering y = 7x — 10. 


Linear inequalities 


29. Show that the graph of y « mz-+ ὃ is the set of all points below the 
graph of y = ma-+ ὃ. 

In 30 through 33, graph the sentence. 

30. y < 2. — 1. 31. y > —z-+ 3. 

82. στ 2y «1. 88. 382 — y = 5. 

34. Graph the possibilities in Problem 7 if the individual can spend any 
amount less than or equal to $100. 


In 35 through 40 let A, B, and C be the graphs of the sentences in Problems 
30, 31, and 82, respectively. Graph. 


35. A’. 36. B’. 

387. ANC. 388. AUC. 

39. AN BNC. 40. ATUBUC. 
* Segments 


To write a sentence defining a straight line segment, 1.6., a portion of a straight 
line between two points, we need merely write the sentence for the entire straight 
line joining the points and, in conjunction with it, a sentence requiring one of 
the variables to lie between the values at the two endpoints. For example, 
y = 2. - Ἰ ΔΛ —2 < x < 2 defines the segment joining (—2, —3) and (2, 5) 


as sketched in Fig. 4-29. ᾿ 


(2, 5) 


(—2, —3) 


Figurr 4-29 
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Graph the sentences in 41 through 44. 


wK4o yy H=att+lAd0<2z<3. 

*42. 22 — ὃν =9A1L<2< 5. 

κά43. HOS t<2ArH14+38IAy H= 2-8. 
w44.y = δι +3 A1<y< 2. 


*45. Show that {(z, ψ)}} 30 <t< 1A (ὦ, y) = (a1, y1) + Hye — y, x2 — 21)} 
is the closed segment joining (21, y1) and (22, y2). 

* 46. Write parametric equations of the segments joining the points in Problems 
15 through 18. 

* 47. By means of tests students are given scores in linguistic aptitude (x) and 
mathematical aptitude (y), where scores may vary from 0 to 800 continuously. 
if we consider general intellectual ability to be measured by the sum of these 
scores, sketch and describe the locus of points corresponding to a total score of 
1100. Do the same if we consider 2x-++ y as a measure of ability. (The Language 
of Social Research, by P. F. Lazarsfeld and M. Rosenberg, p. 47) 


Graph 


#48. |x| + [ν] = 
*49. [1] + |y| re 
αὔθ. [a+ yl «1. 


ANSWERS TO EXERCISES 


(a) e=1-+h,y=3+4k. (0) te = —hk,y = —5k. (ce) y—38 = 4{(2 ---ΤἸ). 
(d) y = 52. (e) (0, —2), (1/2, 0). (ἢ (0, 3.5), (7/3, 0). (6) The para- 
metric equations are not unique. (h) (8) subtracting mx; and mz2; (9) a = 
bAb=c—>a=ce; (10) adding γα — γι and factoring; (11) dividing by 
2 — x, (i) 1/2. G) —l. (1) (ὃ —1)/a. () 3. (m) —273. (n) (11). 

(ο) —5, (0,0). (p) —7, (0, 3.5), (0.5,0). (4) It is equivalent to y = 
ma -ἰ- (yy — mx), and (21, y1) satisfies it. (Ὁ) y+1 = 2(2—5). (8) y Ξ 0. 
(thy =4 (u) y+3 = —7(z2+ 1). (vy) See Fig. 4-30. (w) A vertical 
line. Note that having no slope is quite different from having a slope of zero. 


0 


I ἢ 


Figure 4-30 


ANSWERS TO PROBLEMS 


1. 3/4, (0, —3), (4, 0). 3. —1, (0, 10), (10, 0). 5. 2/5, (0, —2), (δ, 0). 
7. {(0, 10), (2, 5), (4,0)}. 8. Segment joining (0, 10) and (10, 0), since x > 0, 
yo 0. 9 1l-at+(—-ly+0=0. 11.0-¢4+ 1-y+7=0. 13. Through 
origin. 15. 3/5. 17. —1. 19. Straight line with slope 1/3 through (2, 3). 
21. Straight line through (1, —1) with slope 2/3. 23. Parallel to y-axis; parallel 
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to z-axis; coincident with an axis; through origin. 24. (6); and substitute co- 
ordinates to show it passes through the points. 27. x-intercept is found by (y: 0). 
29. If y = mz-+ ὃ, (α, y) is on the line. If y is less, (ὦ, y) lies below. 31. Above 
the line. 33. Below and on the line. 35. Above and on the line. 37. A wedge 
widening to the right and down. 39. Interior of a triangular region and 
part of its boundary. 41. [(0,1) (3,4)]. 48. [(4,2) (7,0)]. 45. The end- 
points are given by? = Oandt = 


4—6 Simultaneous linear equations. A great many problems in science 
lead to finding the intersection of two straight lines. Suppose, for ex- 
ample, that p is the price and g the amount of a commodity. The amount 
that consumers are willing to buy (the demand) may be related to the 
price by an equation such as gq = —0.5p + 10. (See Fig. 4-31.) The 
amount that sellers will be willing to offer for sale (the supply) may be 
given by an equation such as g = p — 5. The actual price and amount 
sold (both offered and demanded) must satisfy both equations, 1.6., must 
satisfy ᾳ = —0.5p +10 A q=p-— 5. 

The coordinates of the intersection may be found exactly by manipu- 
lating the two equations together. Equating the two expressions for q 
gives p — 5 = —0.5p + 10, from which p= 10. Substituting this 
value of p in either equation and solving for gq, we find g = 5. Hence the 
solution is (10, 5) 

q 


q = —0.5p + 10 


Figure 4-31 


(a) Solve the problem if supply is given by ᾳ = 2p — 7 and demand by 
g = 11—p._ (b) Solve for the couple (2, y) that satisfies both 2x — y = 4 
and « -ἰ 2y+3 = 0. (c) Find and graph {(z,y)|2+ y = 25 A y = 31). 


When we wish to find ordered pairs that satisfy both of two equations, 
we call the equations simultaneous. The same term is used when two or 
more equations, inequalities, or other sentences are involved. To solve 
several simultaneous sentences is to find the set of solutions that satisfies 
their conjunction, 1.e., all of them at the same time. It amounts to finding 
the intersection of their truth sets, i.e., the set of points common to the 
sets defined by the individual cence 
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Since the graphs of linear equations are straight lines, the solution of a 
pair of simultaneous linear equations is a single point (ordered pair) if 
the corresponding lines meet in one point. If the corresponding lines are 
parallel, there is, of course, no solution. It may also happen that the two 
linear equations define the same line. In that case any point on this line 
is a solution. 

We dispose of this last case first. Suppose the two equations are Ax + 
By + C = Oand A’x + Bly + C’ = 0. Assuming first that none of the 
coefficients is zero, the imtercepts are (0, —C/A), (—C/B,0) and 
(0, —C’/A’), (—C’/B’, 0), respectively. Clearly the lines are identical if 
and only if C/A = C’/A’ and C/B = C’/B’, or A/A’ = B/B’ = C/C’. 
We describe this by saying that the coefficients are proportional. If any 
of the coefficients are zero, some of these expressions may be undefined. 
However, the condition may be restated as follows: there exists a k such 
that A = kA’, B = kB’, andC = κα΄, We leave it to the reader to show 
that this condition is also necessary and sufficient in the cases where some 
of the coefficients are zero. To summarize, 


(4) f(@, y)|Az+ By +C = 0} = (ὦ, y) [Ae + By +C’ = 03] 
oak A= kA’ \ B= kB’ AC = kC’ 


(d) Write several equations equivalent to 832 — 5y+ 1 = 0. (e) Find 
the intercepts of 2x — 3y-+1=0 and of 4: —6y+2= 0. (ἢ Show 
that C/A = C’/A’ A C/B = C'/B' «» A/A’ = B/B’ = C/O akA = 
kA’ AN B= kB’ AC = KC’. 

When a single solution exists, simultaneous linear equations can always 
be solved by manipulations such as those described in Section 2-8, except 
that now we must carry along two equations instead of just one. We 
may operate on either equation alone by replacing any term by a synonym 
(see manipulation Type A in Section 2-8) or by multiplication or addition 
as indicated by (2-8-4) and (2-8-3). In addition to these manipulations, 
we may add corresponding members according to the law 


(2) a=bAc=aeo(ate=b+dAc= a). 


(g) Illustrate (2) and prove it. (8) Show that (2) would be false ifc = ἃ 
were omitted from the right member. 


We could have solved the equations in the first paragraph of this sec- 
tion by applying (2) with (a:¢, b:—0.5p + 10, c:—gq, d:—(p — 5)), after 
first multiplying both members of g = p — 5 by —1. This amounts to 
subtracting the corresponding members of one equation from the other. 
Indeed, (2)(c:—c, d:—d) yields 


(3) a=bAc=ae(a—c=b—dAc= ὦ. 
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The standard method for solving linear equations is to apply (3) in 
order to eliminate one of the variables, after multiplying by appropriate 
constants so that the coefficients of one of the variables are the same in the 
equations. For example, to solve 3x — 5y= 15 A 2x+y= 5 we 
multiply both members of the first equation by 2 and of the second by 
3 to get 6x — 10y = 30 A θὰ - 3y = 15. Then using (3), we have 
—lby = 15 A 2x +y = 5. From this we find y = —15/13 A 2x + 
y = 5. Now we may find z by substituting (y:—15/13) in 2x + y = 5, 
or by eliminating y from the original equations. This could be done, for 
example, by multiplying the second equation by 5, then adding to the 
first by (2). This gives 3x — ὅν = 15 A 10x + ὅν = 25, 132 = 40 A 
2x -+y = 5,andz = 40/138 A 22+ y = 5. 


(i) Check by substitution that (40/13, —15/138) satisfies both the original 
equations. 


We have arranged these manipulations below in a systematic manner 
and also modified the details slightly. 


3x — by = 15 33 —5 15 

(4) 
Qe t+y= 5 2 4 5 
τ ὃς -- 10y= 8ἢ ( 6 —10 30 
6x + ὃν = 15 6 3 15 
—13y = 15 0 —13 15 

(6) 
22: +t+y= 5 Zz 1 ὃ 
τ —13y = 15 ( 0 —13 15 
26a + 13y = 75 26 18 75 
—13y = 15 0 —~13 15 

(8) 
26x -- 80 26 0 80 
a y = —15/13 ( 0 1 —15/13 
x — 40/13 1 0 40/13 


(j) Explain each transition from (4) to (9). 


Each pair of equations is logically equivalent to the preceding one in 
(4) through (9), so that the last pair gives the solution. However, the 
reader must have noticed that the x, y, =, and + play only a minor role. 
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It is the coefficients that are manipulated. Indeed, the variables and 
equals sign serve only to indicate the proper location of the coefficients. 
This suggests the idea of writing down only the coefficients and manipu- 
lating them without having to rewrite all the other symbols. We have 
done this to the right of equations (4) through (9). Each rectangular 
array of numbers is an abbreviation for the corresponding pair of equa- 
tions. Rectangular arrays οὗ objects are called matrices. There are six 
matrices above. In each one, the row indicates the equation, and the 
column indicates whether the number is a coefficient of x, y, or is the right- 
hand member of an equation. 

Multiplying all members of a row corresponds to multiplying both 
members of an equation. Adding one row to another, term by term, cor- 
responds to adding the members of one equation to those of the other. 
Hence we may solve the equations by manipulating the matrix until we 
arrive, as in (9), at a matrix of the form 


0 1 12 1 0O ΤΊ 
(10) or 
1 0 τι 01 To 


Then (rj, 72) is the solution. The operations permitted are 


I. Multiplying all elements of a row by the same number. 


II. Adding to all elements of a row a fixed multiple of the 
corresponding element in another row. 


(11) 


111. Interchanging rows. 


Solve the following sentences along the lines of (4) through (9), using both 
methods, and check by graphing and substitution: (Κ) 2. - ὅν =6A zx + 
y = 3, ( 8: — 4y = 10 A —2r-+ ὅν = ll. 


The matrix method applies only to linear equations. Others must be 
attacked by the use of (2) and other laws. For example, suppose we wish 
to find the intersection of the graphs of x -ἰ- 2y = 2 and x? + y” = 9, 
as sketched in Fig. 4~32. Solving the first one for x we get = 2(1 — y). 
Replacing « in the second equation by 2(1 — y), we find 4(1 — y)* + 
y2 = 9. Simplification yields 5y? — ὃν — 5 = 0, from which we have 
y = (4 + V/41)/5. The corresponding values of ὦ may be found by sub- 
stituting in the linear equation. We find in this way that the two unlabeled 
points are ((2 + 2v/41)/5, (4 — v/41)/5), (ὦ -- 2V/41)/5, (4+ V41)/5). 


(m) Express the coordinates as single decimals and verify that the corre- 
sponding points are approximately as suggested by the graph. (n) Sketch 
and solve 2x + y =2A 22+ y? = 10. 
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Break-even 
point, R—C=0 


Figure 4-32 FIGURE 4-33 


PROBLEMS 


Solve 1 through 4 by the matrix method. 


1. 5a -+ 2y = 6, 2. 54 — 4y = 7, 
z—sdy = 1. 2x — ὃν = 10. 
3. 1.5¢ + 3.6y = 10, 4,.x%—y = 5, 
2x —1.2y = 15. —17x-+ ὃν = 1. 


If the sales of a firm are x and the price per unit is p, then the revenue 
from sales & is given by R = pz. If the overhead is ὃ, and the variable cost 
per unit is m, then the cost of producing and selling z units is given by C = 
mz-+ 6b. For definiteness let R = 0.5% and C = 0.252+ 10. Measuring 
& and C along the y-axis, we may graph the two equations as in Fig. 4-33. At 
any particular point along the x-axis, the distance up to the revenue curve gives 
the revenue, the distance up to the cost curve gives the cost, and the vertical 
vector from the cost curve to the revenue curve (R — C) gives the profit. At 
the point where the two curves meet, the profit is zero, and the firm “breaks 
even.” To the left, it loses money. To the right it makes a profit. Such charts, 
called break-even charts, are widely used in business management. 


*%5. Make a break-even chart for R = 1.52%, C = 50-+2, and find the 
break-even point by algebra. 

*6. Do the same for RF = 1.52, C = 25 -᾿ 2z. 

*7. Do the same for R = 1.52,C = 25-+ 1.22. 

*8. Two businesses selling the same product at price p have cost functions 
given by Cy = 200 -Ἐ 0.52 and Ce = 100+ x. Which is in the better position? 

*9. Find a formula for the coordinates of the break-even point in terms of 
p,m, and 6 for R = ρα, Ο' = mz-+ ὃ. 
*10. Show that the break-even point is at the sales where total cost per unit, 
C/x, equals the price. | 
*11. Show analytically that two lines cannot have more than one point in 
common. 
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% 12. Three lines are said to be concurrent if. they have a point in common. 
Find a necessary and sufficient condition for this in terms of the coefficients of 
their equations. 

* 13. Show that the medians of a triangle are concurrent. 

*14. Graph 22 —y+t4=0, S5¢-+ 38y+ 15, and ki(Qe —y+ 4) + 
ko(5a + ὃν + 15) = 0 for (ki, ke) = (1,1), (1,—1), and (2, 8). 

*15. Show that for any k, and ke, the graph of ki(ax-+ by-+c) + 
ko(a’x + b’y + ce’) = 0 is a straight line passing through the intersection of 
az + byte = Oanddz+ δψ -Ῥ οἱ = 0. 

% 16. What happens if the two lines are parallel? 


Find the line through the intersection of 2x — y = 7 and 5ia- 33y = 2 
and 
* 17. having a slope of 1, 
* 18. passing through the origin, 
* 19. passing through (—5, 7). 


% 20. Prove (12). 
(12) ac#0—>{[(a=bAc=a4)< (ας = dbAc = a)]. 


*21. Show that [12 = k?u? +- 22 A 22/(1 — u?) = 2k] -- [235 = χξᾷ — ἐδ}. 
*% 22, Let hy = —C+ 6+ 6, ho = —C— 0+. Find @ in terms of hi 
and hg only. (Astronomy.) 

*23. Prove the identity [(z2 — y)/2]? = [(« + y)/2)? — xy, and show how it 
can be used to solve zy = 1 A x+y =a. (This method appears in a mathe- 
matical treatise of about 300 B.C.) 

*24, Given a(a — ao) = hid, α(β — Bo) = had, a(Y — Yo) = hgd, αὖ + 
B2-+ 7? = 1, and a%+ 62+ 72 = 1, find a formula not involving a, 8, Y 
that gives λ. 


ANSWERS TO EXERCISES 


(a) (6, 5). (b) (1, —2). (c) (6.25, 18.75). (d) 62 — 10y-+ 2 = 0, 
5y = 3a-+ 1, ete. (ὁ) (—1/2,0), (0,1/3) for both. (ὃ Multiply both 
members of first equation by A/C’ and of second by B/C’; then these ratios are 
equal if and only if they are equal to some k for which the three equations hold. 
(g) (2-7-36) and (2-8-3). (h) (α:2, 6:3, ¢:1, 4:0). () Review the comments 
about (2-8-1). (j) Multiply both members of first equation by 2, second by 3; 
subtract second from first, and then divide second by 3; multiply second by 
13; subtract first from second; divide first by —13, second by 26. (k) Check 
by substitution. From now on answers will not be given where this can be done. 
(n) Two solutions. 


ANSWERS TO PROBLEMS 


8. The second for output less than 200, the first beyond that. 11. Show that 
conditions (1) are satisfied. 13. Find midpoints of sides and use two-point form 
to get equations of medians. Solve two of them and show that the third is 
satisfied by the result. Or use the result of Problem 12. 
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4—7 Parallel and perpendicular lines. To investigate the possibility 
of parallelism we attempt to solve Aya + Byy = Cy A 495 + Boy = Co. 
To save writing we use the matrix method. 


A, By Ὁ, 
(1) 
Ag Bo Cs 


ὃς fo B,Be CiBe 
BA B, Bog B,C. 

is oe 0 (6,82 — B,C) 
As Be C2 


(a) Explain how (2) and (3) were obtained. 


The first line of (3) means (A,B. — B,A2)x = (C, Bo — B,C). 
Clearly this gives a unique solution for + unless 4.8. — B,Az = 0. 
Hence, 


(4) Αγ Ἢ Byy = C, A Agr + Boy = Ce has a unique solu- 
tion tf and only if 4.8. — B,Ae ¥ 0. 


What if A,B. — B,Az = 0? Then there is no value of x satisfying 
the first equation of (8) and hence no solution if 6.8. — B,C. ¥ 0. 
Hence, 


Aya + Byy = Cy A Aor + Boy = Co has no solution and 
(5) the corresponding lines are parallel if and only if A,B. — 
B,Ao = 0 a C, Bo more ΒιΟ.-. 


On the other hand, if A,B, — ByAg = 0 = (6.8. — B,C2, then any 
value of x is a solution of the first equation in (3). Hence any pair that 
satisfies the second equation satisfies both, and there is only one line. 
We have found (4-6-1) again. 


(b) Show that 4182 — Bi Az = 0 = C1 Bg — ΒΊΟΣ is equivalent to the 
condition given in (4-6-1). 


The result (5) can easily be given an interpretation in terms of slope. 
If the lines are nonvertical, the equations can be put in the form 


y = (—A1/Bi)x + (Ci1/B1) and y = (—A2/Bo)z + (C2/Bd). 


Now [—A,/B, = ---Α. 89] «Ὁ» [4182 — Bi Az = 0], 
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and IC, /B, x C2/Bo] «- [Ci Be a BC, ΕΖ 0.] 
Hence, 
(6) Two nonvertical lines are parallel τῇ and only 1} their slopes 


are equal and their y-intercepts different. 


Note that if, in addition, the y-intercepts are equal, there is only one line. 
(c) Justify the asserted equivalences. 


When there is no solution, the equations are called inconsistent. When 
there is one solution, they are called consistent and independent. When 
there is an infinity of solutions (coincident lines), the equations are called 
dependent. 


Characterize each of the following pairs of equations precisely, using the 
above terminology; graph; attempt to solve; and check the theorems: 

(d) 25 — By = 4A 22x — ὃν = 2, (e) 2x — ὅν = 4A 2 — 1L.dy = 38, 
(Ὁ 22 -- ὃν = 4A2—1d5y = 2, (g) 2a — ὃν = 4A 2x4 ὃν = 4. 


We can express the condition for parallelism in terms of vectors. We 
recall from Exercise (n) in Section 4-5 that for any vector (7, s) in a line 
of slope m, s/r = γι. Hence from (6), nonvertical vectors (x, y) and 
(x’, y’) lie in parallel lines or in the same line if and only if y/z = y’/z’. 
Assuming y’ τέ 0, this is equivalent to y/y’ = x/x’, or to Iky = ky’ A 
x = kx’. But this last condition says simply that (2, y) = k(2’,y’), 
that is, that one vector is a multiple of the other. In case y’ = 0, we have 
immediately y = 0, (2, y) = (2,0), (2’, ν΄) = (α΄, 0), and again one 
vector is a multiple of the other, since (x, 0) = (a/z’)(a#’, 0). Since any 
vertical vector is a multiple of any other, we have in all cases: 


(7) If u and v are non-null vectors, u and v lre in the same or 
parallel lines if and only if Iku = kv. 
(h) Prove that any vertical vector is a multiple of any other. (i) Why 
“non-null” in (7)? 


We apply (7) to prove analytically the well-known theorem of plane 
geometry that the segment joining the midpoints of two sides of a tri- 
angle is parallel to and of length one-half the third side. Figure 4-34 shows 
the triangle determined by three vectors, U1, U2, ug. We have 0.5ug + 
u, = αι + 0.5u2. Solving for uy and using ug = τὰ + Ue, we have 
u, = u, + 0.5u. — 0.5(u, + ug) = 0.5u,. Since uy is a multiple of 
u,, it is parallel to it. Moreover, by (4-4-5), the eagen of τὰς is one-half 
that of u,, and the theorem is proved! 
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FIGURE 4-34 Figure 4-385 


Under what conditions are two vectors perpendicular? Figure 4-35 
shows (a, δ) L (α΄, δ΄), together with their component vectors. The sum 
of the three angles at P is two right angles. Since 2 TPQ is a right angle, 
it follows that ZSPT and ZRPQ are complementary. But since 2 PST 
is a night angle, ZSP7 and ZSTP are complementary. It follows that 
ZRPQ and ZSTP are congruent. But now we have shown that the two 
right triangles PST and QRP have congruent angles. Hence they are 
similar and their corresponding sic sides are proportional ; that is, ,QR/PR = 
PS/TS. Now, in Fig. 4-35, QR = ὃ, PR = a, TS = b’, PS = - ᾿α’, 
since a’ < 0 and we wish the lengths. Hence we have b/a = —a’/b’ = 
—1/(b’/a’), or (b/a)(b’/a') = —1; that is, the product of the slopes is 
—1. We describe this by saying that the slopes are negative reciprocals. 
Does our argument depend on the particular position of the vectors in 
Fig. 4-35? Yes, it does; but the same result holds regardless of the posi- 
tions of the vectors, provided that neither of them is vertical. 


(j) What happens if one of the vectors is vertical? (k) Justify the algebraic 
manipulations above. (1) Show by rotating two perpendicular vectors that 
we always have either one or three of their components negative, and hence 
that the proportionality of sides of the right triangles always leads to the above 
result. 


We have justified the following theorem. 


(8) ᾿ Two nonvertical lines are perpendicular if and only if their 
slopes are negative reciprocals. 

We can restate the above results so as to avoid the limitation to non- 
vertical lines. The condition b/a = —a’/b’ is equivalent to aa’ + bb’ = 0. 
This condition is also necessary and sufficient when one of the vectors is 
vertical, for if one vector is vertical they are perpendicular if and only if 

they are of the form (a, 0) and (0, δ), in which case clearly aa’ ε bb’ = 
aQ - Οὐ = 0. Hence we have: 
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(9) A necessary and sufficient condition that two non-null vectors 
(a, b) and (a’, b’) are perpendicular is that aa’ + bb’ = 0. 


Which of the following pairs of vectors are perpendicular? Check by sketch- 
ing. (m) (2, 3), (3, me (n) (, 2), (—3, “ul ἢ (o) (2, —6), (3, 1). 

(p) Why did we exclude null vectors in (9)? (q) Show analytically that no 
vector can be perpendicular to itself. (r) Show that ulv—ovtlu. 
(s) Show that uLv— —u tv. (t) Prove analytically that the diagonals 
of a square are perpendicular. 


PROBLEMS 


In Problems 1 and 2 sketch all lines on the same plane. 


l. y = 3x — biorb = 1, 3, —2, 0. 

2.y = mx-+ 3 form = 5, 2,1, 1/3, —2, 0. 

3. Find the line parallel to 54 — 2y = 7 and passing through (0, —1). 
4. Same as in Problem 3 but passing through (—5, 3). 

5. Same as in Problem 3 but perpendicular. to the original line. 


Show 6 through 13 analytically. 


6. If the opposite sides of a quadrilateral have equal length, it is a parallelo- 
gram. 
7. The converse of the previous theorem. 
8. The diagonals of a parallelogram bisect each other. 
9. The locus of points equidistant from two points is the perpendicular 
bisector of the segment joining them. 
10. The altitudes of a triangle are concurrent. 
11. The perpendicular bisectors of a triangle are concurrent. 
12. If a chord of a circle is perpendicular to a radius, it is bisected by it. 
13. The diagonals of a rhombus bisect each other at right angles. 


14. Find the equation of the line through (Χο, yo) perpendicular to the line 
Ax+ Byt+C = 0. 

15. Find the point where the perpendicular in Problem 14 meets the line 
Ax+ BytC = 0. 

16. Use the result in Problem 15 to show that the length d of the perpendicular 
from the point (ro, yo) to Ax + By + C = 0 (..6., the distance between the 
point and line) is given by 


(10) Fees see 2 ely 
VA2+ 82 
17. Sketch 3x — y — 16 = O and the points (5, 1), (1, -- 13), (0,0). Find 
the distance of each point from the line. 
18. Find the distance from the origin of each line in Problems 1 through 6 
in Section 4-5. 
19. Show that (1, 5) and (—1, 1) are equidistant from the line y = 42 -++ 3. 


4-8] THE CIRCLE | 209 


* 20. Write a defining sentence for the set of points within a distance of 2 of 
the line y = x — 4. Sketch the locus. 
*21. What is the distance between the lines y = 81: - 2 and y = 3x — 4? 


ANSWERS TO EXERCISES 


(a) Multiplying first row by Bez and second by Bj; subtracting second row 
from first. (Ὁ) Divide first equation by 4282, the second by CeBe. (c) See 
Iixercise (Ὁ), and use [p< 4] @ [~p <> τα]. (ἃ) Inconsistent. (6) Incon- 
sistent. (f) Dependent. (g) Consistent and independent. (8) (0,6) = 
(b'/b) (0, δ). (1) Vz,y (0,0) = O(2, y). (j) It has no slope. (k) See 
Section 1-9. (1) Draw a series of figures, or rotate a figure consisting of just 
the vectors, observing how the components change. (m) Yes. (n) No. 
(0) Yes. (p) The condition is always satisfied if one of the vectors is null, 
but a null vector has no direction. (4) For (9) would require a? -+- 62 = 0, 
which is true only for the null vector. (r) From (9). (8) From (9). (0) Let 
the sides be (a, 0) and (0, a), so that one diagonal is (a,0) + (0, a) = (a, a) 
and the other is (a,0) — (0,a) = (a, —a). Then the condition (9) becomes 
a? — a? = 0. 

ANSWERS TO PROBLEMS 


1. Parallel lines all of slope 3. 2. Lines with different slopes through the point 
(0, 8). 8. ὅχ — 2y = 2. 5. 2. + 5y+5 = 0. 7. For simplicity let one side 
be parallel to z-axis. Use vectors. 9. Find equation by equating distances from 
fixed points (a, 0) and (6, 0) to (z, y) on the locus. Simplify to show that the 
resulting line is vertical and passes through ((a + 6)/2, 0). 


4-8 The circle. We have seen already that the graph of 
(1) @—h)?+y—kh=r 
is the circle with center (A, &) and radius r. 

(a) Review (4-4-15) and related portions of Section 4-4. 

If we expand (1) and rearrange, we find 
(2) x? + y? — Qhe — My +h? +h? — 7? = 0. 
This is in the form 
(3) a + y?+ δὰ - Εν +F = 0. 


Is any equation of form (3) a defining equation of a circle? To answer, 
let us see whether we can determine ἢ, k, and r in (2) in terms of ἢ), E, 
and F in (3) so that the two equations are equivalent. We wish to have 
ἃ = —2h, H = —2k, and h? + k? — r? = F. Solving the first two for 
h and k, and substituting in the third, we arrive at 


(4) h=—D/2, k= —E/2, 1? = D2/4+ B2/4 — F. 
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Hence for any D, Εἰ, F we can find ἢ, k, and r so that (3), (2), and (1) are 
logically equivalent. Hence, 


The graph of x? + y? + Dx + Εν + F = 0 its the circle 
(5) with center (— D/2, —E/2) and radius r given by r2 = D?/4 + 
E?/4 — F, provided 4F < D? +- ΕΞ. 


We illustrate by finding the center and radius of the graph of x? + y? + 
4x - 2y—-1=0. Here A = 4, B= —2, F = —1. Then ἢ = —2, 
k=1,r?=4+1-+1=6. Hence the center is at (—2,1) and the 
radius is \/6. 


Find the center and radius of the graphs of: (Ὁ) 27+ y?-+ 25 — 4y — 
1=0,and (ὁ) a?-+ y? — 8a+ V2y+2 = 0. 

(d) Why “provided 4F < D*%-+ E?” in (5)?  (e) What is the graph of 
g?2t y2t 2. -- ἀν - ὃ -- ΟἹ (ὃ of 224+ y?+ 2. -- ἀν - 7 = 0? 


Replacing ἢ, k, and r? in (1) by the expressions given in (4), we have 
(6) (x + D/2)? -- (y + E/2)? — D’/4 — E*/4+F = 0. 
Expanding, we find 
(7) 22+ De + D?/4+ y? + Fy + ΕΑ — D?/4 — F7/44+F = 0. 


We see that (7) is synonymous with (8), as expected. Indeed, (7) may 
be obtained by adding and subtracting D?/4 and adding and subtracting 
E?/4 in (3). This observation suggests a method of putting an equation 
of the form (3) into the form (1) without using (4). Beginning with (3), 
we group the terms in x and y together. 


(8) a?+ Dre+ y*+tEy 4t+F=0. 


Then we add a term to make a perfect square out of the terms in z, and | 
compensate by subtracting the same term. 


(9) a? + Dx - (D/2)? + y? + Ey — D*/4+ F = 0. 
We add the square of half the coefficient of x, since 
ω + D/2)? = x2 + 2(D/2)x + D?/4 = 2? + Dx + D?/4. 


We do the same for the terms in y to arrive at (7). From (7) to | (6) is a 
matter of factoring the perfect squares. And (6) is in the form (1). We 
illustrate. Se ον ἢ | 
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(10) g?t 5.1 4¢ -- ἢν -- 1 - 0, 

(11) (Ὁ τά )1 χ᾽ —2+ )-1=0, ἔτι 
(12) (e+ 42+ 4) + (y? -- 2 +1)—-4—-1-1=0, | 
(13) (c+ 2)? 4+ ( -- 1)3 =6. 


The process of adding a term to an expression of the second degree to 
make it into a perfect square is called completing the square. In (10) through 
(13) we completed the square on x and on y. 


(g) Find the center and radius of the circles in Exercises (b) and (c) by com- 
pleting the square. (h) Where did we use completing the square in a previous 
chapter? 


We can now prove geometric theorems and solve geometric problems 
about circles by analytic methods. For example, we can easily prove 
that a straight line meets a circle in two, one, or no points. For suppose 
the circle is defined by (3) and the line by y = mz + ὃ. Solving the two 
simultaneously by replacing y by mz + ὃ in (3), we find 


(14) (1 + m?)z? + Qmb+ D+ Em)x +b? + FbO+F = 0. 


Since this is a quadratic equation, it has two real roots, one real root, or 
none, according as (2mb+ D+ Em)? — 4(1 + m?)(b?-+ Fb+F) is 
positive, zero, or negative. 


(i) Explain the last statement. (7) What do we call the line if there is 
only one root of (14)? (k) Have we covered all possible lines in our proof? 
(1) Find the intersections of x*-+ y? = 9 with y = 2x, and (m) with 
y = 24+ 1. 


Regions bounded by circles play a role in many applications. If we put 
< in place of = in (1), we get an inequality defining the interior of the 
circular region bounded by the graph of (1). 


(n) Why? (0) What if we put > in place of =? (p) <? (q) >? 
(r) Graph x? + y? < 9. 


To illustrate the application of these ideas in economic theory, imagine 
two factories producing the same commodity at the same cost but located 
in different communities. The cost of distributing the product depends on 
the distance of the consumers from the plants. For simplicity, we place 
the axes so that the plants lie at the origin and at the point (a, 0), as 
indicated in Fig. 4-36. For maximum efficiency we wish to have each 
plant supply those customers for whom its transportation costs are 
smaller. Suppose that the plant at the origin (plant P,) spends c, per 


212 PLANE ANALYTIC GEOMETRY | [cHaP. 4 


(a, 0) 
Py 
| FigurRE 4-36 


unit per mile and that the other plant (plant 102) spends cg per unit per 
mile to deliver the product. For simplicity, assume that delivery may be 
made by travel in a straight line, so that for a customer at (zx, y) the cost 
of the delivery of a unit is given by Cy = ον ᾽ξ + y? for the first plant, 


and Cs = ον (x — a)? + y? for the second plant. Accordingly, the 
first plant can supply the commodity more cheaply if and only if Cy < Ca, 
and the second if and only if C; > C2. The points where the cost is the 
same are those satisfying ΟΣ = Ce. Now, Cy = ( 15 


(15) eV a? + y? = σον (ἃ — a)? + y?. 

Since 61 and 65 are non-negative, this is equivalent to 

(16) c? (x? + y?) = c2 (x? — 2ax + a? + y”), 
which simplifies to 

(17) (c2 — c2)u? + (c? — c2)y? + 2ackx — ac} = 0. 
Dividing both members by the coefficient of x? and letting 


s= οἰ — 2), 
we have 


(18) 2? + y? + 2.8. — a*s = 0, 


which is, by (5), the equation of the circle with center at (—as, 0) and 
radius αν 82 + s. Hence the boundary is a circle. One plant should serve 
customers within the circle and the other those outside the circle. The 
location of the circle depends on the values of a, 61, and 62. 


Assume, as in Fig. 4-36, that a > 0. %*(s) Suppose that the first firm has 
the lower transportation costs so that c1 < ca. Show that 8 < 0, —as > 0, 
—as > a, and αν 82 + 8 < —as, so that the center of the circle is to the right 
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of P2 and the second plant serves just its interior, while the first serves all points 
outside. +*(t) Sketch the previous situation for a = 10, cy = 1, co = 2. 
*(u) Draw similar conclusions for the case where οἱ > ce, and sketch this 
situation for a = 10, cy = 2,c2 = 1. κί) What happens when cy = ce? 


PROBLEMS 
In Problems 1 through 10, complete the square and sketch the locus. 


1, a7 + y? — 22+ ὃν — 1 = 0. 

2. 27+ y?+ δὰ — by — 10 = 0. 
3. 27+ y? — Br = 0. 

4. χ2- y2 + Qy = 1. 

δ. Qe? + Qy2 + δα — 13y = 3. 

6. x27-+ y? — 2x — 3.2y = 0. 

7. 8a — 22+ ἄν — y? = 1. 

8. 10 — 2 — y — 2x? — Qy* = 0. 
9, 27+ y? —z4+ 25 = 0. 

10. 22+ y2 + 22 — ἀν = —5. 


*11. Find a formula for the solution of 22+ Azx+ Β = 0 by completing 
the square. 

In Problems 12 through 21, sketch the locus. 

12. 22+ y2 < 18. 

13. 27+ y? > 16. 

14. (2 — 8)2- (y — 2)2 < 4. 

15. 27+ y?+ 22 > —1. 

16. 27+ y2< 0A —-2<2< 2. 

17. χ +t+y< BWA y <2 «1. 

18.4 < 27+ ν «9. 

19. 27+ y2 < 0. 

20. «΄ 1 γ' <0. 

21. 2° + y? > 25: Ay > 2. 


* 22. Show analytically that a tangent to a circle is perpendicular to the radius 
at the point of tangency. 

* 23. State and prove the converse of Problem 22. 

% 24. Show analytically that two circles meet in at most two points. 

*25. Show analytically that the segment joining the centers of two circles is 
the perpendicular bisector of their common chord. 

* 26. Show that the length of a tangent line to the circle F(x, y) = 2? + y? + 
Dz -+ Ey -Ἐ F = 0 from the point (zo, yo) is V F (20, yo). | 

% 27. Prove that for any value of m the lines y = mz + aV1-+ m? are tan- 
gent to the circle x? + y? = αϑ. 

*28. Prove that xor-+ yoy = a? is tangent to the circle 22+ y? = a? at 
the point (xo, yo). 

* 29. Prove that a line through the center of a circle perpendicular to a chord 
bisects the chord. 

*%30. Prove the converse of Problem 29. 
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ANSWERS TO EXERCISES 

(0) (—1, 2), V6. (ὦ (1.5, --ν 2,2), 7/2. (4) If this is not satisfied, we 
have no points on the graph, or only one, according as the right member of the 
equation for 72 in (4) is negative or zero. (6) The single point (—1, 2). 
(Ὁ) Empty. (h) (2-8-14) through (2-8-16).  () This is the discriminant; 
see Section 2-8. (j) Tangent. (k) No, not vertical lines; but the statement 
clearly holds for them, since letting « = c in (2) we have a quadratic for y. 
(l) (8/75, 6/5), (—3/V5, —6/V5). (m) Substitute to check. (n) Points — 
that satisfy it are at a distance from the center of less than the radius. (0) 
Exterior. (p) Interior and boundary. (q) Exterior and boundary. (1) In- 
terior of circle with center at origin and radius 3. 


ANSWERS TO PROBLEMS 


1. (1,-1.5), “4.25. 8. (2.5,0),2.5. 5. (—5/4, 18/4), V218/4. 7. (1.5, 2), 
/21/2. 9. Empty. 11. Check by substitution and compare with quadratic 
formula (a:1, 6:4, ¢:B). 18. Exterior of circle with radius 4. 23. A line per- 
pendicular to a radius at a point on a circle is tangent. Write equation and show 
that it meets the circle in only the point. 25. Extend the idea of Problem 15 
in 4-6 to show that the equation of the common chord is obtained by sub- 
tracting the left members of the equations. 27. Solve with equation of the 
circle and show that there is Just one root. 


4~9 Directed arcs and angles. Imagine a unit circle (circle of unit 
radius) with center at the origin. Consider an axis tangent to the unit 
circle at the point (1,0) with the positive direction upward, as sketched 
in Fig. 4-37. Let this axis be “wrapped around the circle” without stretch- 
ing or contracting, so that lengths are preserved. The idea is not as simple 
as it seems, but we ignore the mathematical difficulties here. We take for 
granted the theorem of high-school plane geometry that tells us that the 
length of the circumference of a circle of radius r is 27r, where 7 is a real 
number (approximately 3.14), the same for all circles. Hence the length 
of the circumference of the unit circle is 27. The interval (0 27) on 
the axis wraps around the circle only once, so that the point 27 goes into 
the point (1, 0). | 

(a) Explain the points indicated in Fig. 4-37. (0) Sketch a unit circle and 
indicate the points into which the following points on the axis are carried by the 
above maneuver: 7/4, 39/4, 4, ---π, —1/4, 34. (ὁ) How many different points 
on the axis go into each point on the circle? 


The circumference of the unit circle is now a kind of map of the real 
numbers and of the axis. To every real number (point on the axis) there 
corresponds just one point on the circumference. To every point on the 
circumference there corresponds an infinite number of real numbers 
(points on the axis). Suppose that the real number @ corresponds to a 
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Figure 4-37 Figure 4-38 


certain point on the circumference. The other numbers that correspond 
are all those obtained from the formula @ + 2k7r, where k is any integer 
(positive or negative), since an interval of length 27 wraps around the 
circle just once. Corresponding to each segment on the axis we have an 
arc of the circle. But again, each are corresponds to an infinite number of 
different segments on the axis. These correspondences are not one-to-one. 

Corresponding to each vector on the axis we have a directed are on the 
circumference. We consider a directed arc to be determined by its length 
and direction, regardless of its position on the circle, just as we consider 
a vector on the axis to be determined solely by its length and direction. 
Then there is a one-to-one correspondence between the vectors and 
directed arcs. Indeed, the arc corresponding to a vector is obtained by 
wrapping the vector around a portion of the circumference. We suggest 
this in Fig. 4-38, where we have chosen a vector of length greater than 
27 in order to illustrate the convention of leaving the wrapping incomplete 
to suggest the length of the directed arc. 

From now on we shall consider only directed arcs and shall refer to them 
simply as arcs. When an arc is placed so that its initial point is on the 
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X-axis, 1t 15 said to be in standard position. Arcs in standard position cor- 
respond to vectors with initial point at the origin. 

Because of the one-to-one correspondence between ares and vectors on 
an axis, and the one-to-one correspondence between these vectors and real 
numbers, we have also a one-to-one correspondence between real numbers 
and ares. Indeed, the are corresponding to a real number may be found by 
starting at any point on the unit circle (the point (1, 0) if we want the are 
in standard position) and going a directed distance on the circumference 
equal to the real number. If θ is the real number, we go a distance |6], 
in the counterclockwise direction if 6 > 0 and in the clockwise direction 
if 6 < 0. Because of this one-to-one correspondence we often use the 
same symbol to stand for a real number and the corresponding arc. 

For each of the following sketch a unit circle and a directed arc corresponding 
to the number: (d) 7m, (6) 27, (f) --Ἰ, (6) —57/2. 

We can interpret algebraic operations on real numbers by arcs on a 
unit circle just as we did by vectors on a line. Indeed, the picture on the 
circle may be obtained by wrapping the corresponding linear picture 
around the circle. In particular, any arc equals the difference of two arcs 
in standard position, the are to its terminal point less the arc to its initial 
point, as illustrated in Fig. 4-39, where 6 = A — B. 

(8) Illustrate the identity a-+ (—a) = 0, using ares on a unit circle. 
(1) Sketch the following arcs: 1, 4.14, —5.28, ---ὃπ, —2.14, 2.14. 

The angle concept in plane geometry is usually rather vague. We 
visualize a figure such as that drawn in Fig. 4-40. If we define a ray as a 
point and all points on one side of it on a straight line, we might define 


Figure 4—40 
C 


FIGURE 4-39 Figure 4-41 
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an angie as a figure consisting of two rays with a common endpoint. The 
rays are called the szdes of the angle, and the common point is called the 
vertex. But the reader recalls that the angle concept involves more than 
this, namely, the idea of how much we would have to rotate one side in 
order to make it coincide with the other. Hence Fig. 4-40 is ambiguous, 
since it does not indicate in which direction we are to rotate. To get a 
precise definition, we define directed angles in terms of directed arcs. 

A directed angle is a figure consisting of two rays (called sides) with a 
common endpoint (called the vertex), a unit circle with the vertex as cen- 
ter, and on this circle a directed arc from one side to the other. The side 
on which the initial point of the arc lies is called the inztial side. The other 
side is called the terminal side. The angle is said to be in standard position 
when its arc is in standard position. We shall usually refer to directed 
angles simply as angles. 

An angle is determined uniquely as soon as its are is given, regardless 
of the position in which the figure is placed. Conversely, an angle de- 
termines a unique arc; there is evidently a one-to-one correspondence 
between arcs and angles. Indeed, the angle is precisely the arc with rays 
drawn from the center through its initial and terminal points. The real 
number that corresponds to the arc is considered to be the measure of the 
_ are and of the angle. Because of the one-to-one correspondence we often 
use names of real numbers to stand for the corresponding angles. Two 
directed angles are called congruent if the undirected angles obtained by 
omitting the directed arcs are congruent. 


Sketch the following angles in standard position: (j) 3, (k) 51/3, (1) --π, 8. 


Among the many applications of these ideas, the most obvious are those 
to are lengths, circular areas, and circular motion. Let c be an are on a 
circle and let @ be the corresponding are on the concentric unit circle, as 
suggested in Fig. 4-41. We say that the arc is intercepted by the angle θ. 
A theorem of plane geometry tells us that the lengths of the arcs are 
proportional to the lengths of the radii; that is, (length of arc) / (a) = r/1. 


C = γ)θ], where C is the length of an arc intercepted by the 
angle @ on a circle of radius r. 


(1) 


(m) A boy is flying a model plane on the end of a string of length 60 ft. If 
the string turns through an angle of 7/6, how far does the plane move? (n) The 
plane moved through a distance of 125 ft. Through what angle has the string 
moved? 


From plane geometry we know that the area of the interior of a circle 
is given by wr”, where r is the radius, and that the areas of circular sectors 
(portions of the interior between two radii) are proportional to their cen- 
tral angles (arcs). Hence, if a sector has an angle of 6, we have (area of 
sector) τὺ = |6|/2z7, or 
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A = (1/2) |6| r?, where A is the area of a sector of central 
angle 9 and radius r. 


(2) 


(0) What area has been swept out by the string in Exercise (m)? (p) in 
Exercise (n)? 

(q) A flywheel is turning at 300 revolutions per minute. If it has a radius of 
6 ft, how many feet per minute is a point on the circumference traveling? 


The above system of measuring angles is called the radian system, and 
an angle is said to be given in radians when we give the real number cor- 
responding to its arc on the unit circle. The system is the most convenient 
one in mathematics because of (1), but other systems are in use. They 
may all be defined by taking some other circle in place of the unit circle 
on which to use directed arcs as measures of the intercepting angles. 

If we take a circle of unit circumference, an arc of 1 is intercepted by an 
angle of 27 radians. If we use arcs on this circle to measure angles, our 
unit is revolutions. For example, an angle of 0.25 revolutions is the same 
as one of 7/2 radians. 

If we take a circle of circumference 360, the angles are said to be meas- 
ured in degrees. Parts of a degree may be given in the decimal notation 
or in minutes (1/60 of a degree) and seconds (1/60 of a minute). 

If we take a circle of circumference 6400, angles are said to be measured 
in mils, a system widely used in the armed forces. 


(r) What is the radius of the circle for the revolutions system? (5) for the 
degree system? (t) Since 1 revolution is 2m radians, x revolutions is 27x 
radians, and 1/2m revolutions is 1 radian. Complete the following table that 
gives equivalents in the different systems. (Note: When no units are given, 
radian measure is understood.) 


Radians Revolutions Degrees Mils 
3 20 1 rev 360° 6400 mils 
(3) 1 ? (180/m)° | (3200/m) mils 
? 


a /180 1/360 rev 1° 
φ vs ? 1 


. 


(u) How many degrees in the angle in Exercise (m)? (Ὁ) in Exercise (j)? 
(w) Express the angles in an equilateral triangle in each of the systems of 
measuring angles. 


PROBLEMS 


In each of Problems 1 through 9 sketch a unit circle with the angle correspond- 
ing to the given real number. 


1. w/5. 2. 2r/3. 3. —l1.1n. 
4. 5. 5. U9. 6. 7/6. 
7. —5r/6. 8. —30/4. 9. 2/9. 
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10. Express each angle in Problems 1 through 9 in degrees. 

11. The sun is about 93,000,000 miles away from the earth. Suppose two 
spots on the sun are separated by an angle of 0.1° when observed from the earth. 
Approximately how far apart are they? 

12. A vector of length 5 rotates around its initial point through an angle 
of 35°. Through how long an arc does its terminal point move? 

* 13. Write a defining sentence of the ray making an angle of 45° with the 
X-axis. 

*14. Write a defining sentence of the ray from the origin passing through (a, ὃ). 
* 15. Do the same for the ray with endpoint (zo, yo) and through the point 
(21, yi). 

*16. Write a defining equation of the angle of Problem 1 in standard position. 
*17. The area of a sector of a circle of radius 31 is 1000. What is the central 
angle in radians? Sketch. 

18. What is a necessary and sufficient condition that two angles are con- 
gruent? 


ANSWERS TO EXERCISES 


(a) Since the whole circumference has length 27, half of it has length 7, 
one-quarter has length 7/2. (0) See Fig. 4-42. (c) An infinite number. 
(d) through (g) See Fig. 4-485.. (h) See Fig. 4-44. (7) Just less than a 
straight angle. (k) 5/6 of a revolution. (1) 1/6 of a revolution in clock- 
wise direction. (m) About 31.4 ft. (n) Just over 2. (0) 3007 ft?, 
(p) 3750 2,  (q) 36007 ft/min. (r) 1/2. (s) 360/27. (0) 1/2z7 rev. 
160/9 mils, 7/3200, 1/6400 rev, (9/160)°. (ἃ) 30°. (v) (540/r)°. (w) 60°, 
1/6 rev, 7/3, 1066.6 mils. 


Fiaure 4-42 


FIGuRE 4-44 FIGuRE 4-43 
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ANSWERS TO PROBLEMS 


1. 1/10 rev. 3. Just over 1/2 rev in negative direction. 5. 3.5 rev. 7. —150°. 
9. 40°. 11. (93,000,000) (0.1) (7/180) = 162,000. 13.y=az7Ax> 0. 
15. 36> 0A (ὦ, y) = (20, yo) + τίσι — 1,41 — yo). 17. About 2. 


4-10 Polar coordinates. The rectangular coordinate system used so 
far is based on a grid consisting of vertical and horizontal lines. A point 
is located by giving the vertical line (#-coordinate) and horizontal line 
(y-coordinate) on which it lies. Many other grid systems are possible. 
Polar coordinates are based on a grid consisting of circles with center at 
the origin and rays issuing from the origin. The circle on which a point 
lies determines its distance from the origin, as suggested in Fig. 4-45. 
The ray on which a point lies tells its direction from the origin, as sug- 
gested in Fig. 4-46, where direction is indicated by an arc on the unit 
circle. It is customary to indicate the distance from the origin by 7, 
often called the polar distance, and the direction by @ (theta), often called 
the direction angle. The x-axis serves as the basic reference line on which 
to indicate the distances and is often called the polar axis. The unit circle 
serves as the basic reference curve on which to measure angles (arcs). 

To be more precise, suppose we are given any two real numbers r and 8. 
We lay out the vector r from the origin along the polar axis, then rotate 
it through an angle (arc) 6. Its terminal point determines the correspond- 
ing point. From the previous section it is obvious that many different 
ordered pairs (r, θ) determine the same point. Figure 4-47 shows the 
point (3, 7/3) and two alternative pairs of coordinates. To determine the 
polar coordinates of a given point, we find its distance from the origin, 
then determine any angle from the positive side of the polar axis to the 
vector from the origin to the point. Other coordinates may be found by 


a/2 
1/3 


3n/2 
Figure 4-45 Figure 4-46 
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y (3, 7/3) 


(3, -5x/3) 
(3, 77/3) 


FIGURE 4-47 Figure 4-48 


adding any integral multiple of 27 to the angle or by taking the negative 
of r and adding 7 to the angle, as suggested in Fig. 4-48. 


Plot the following pairs considered as polar coordinates: (a) (1, 1), 
(b) [3]. 1), (0) (, w/2), (d) (ail, 3/2), (e) (0, 0), (f) (0, 5), 
(g) (3, --2π|,3). 


If we are given an ordered pair of real numbers, say (2, 3), there is no 
way to tell, except from the context, whether we mean the point with 
rectangular coordinates 2 and 3 or the point with polar coordinates 2 and 
3. To avoid confusion we write (a, b), for the point with polar coordinates 
a and 6. With this notation we have the following laws. 


(1) (, 9) p a (=, 6 ΞΡ ΤΠ)» 
(2) (γ, θ)» = (7, 6 + 2k)» (k-integral), 
(3) (0, 9) » as (0, 0). 


(8) Justify (1), (2), and (3). Why must k be integral in (2)? 


Any sentence in r and @ now determines a set of points corresponding to 
the ordered pairs (r, @) that satisfy the sentence. As for rectangular 
coordinates, we call this locus the graph of the sentence. Certain loci 
have very simple defining sentences. 


Describe and graph: (i) r = 2, (j) r = —2, (k) 6 = 7/2, (il) @=—1, 
(m) @ = 3.14. 


222 PLANE ANALYTIC GEOMETRY [cHAP. 4 


In working with sentences involving polar coordinates, we must keep 
in mind that the direction angle θ is always measured in radians. Thus the 
locus defined by 9 — 1 is the line making an angle of 1 radian with the 
polar axis (cutting off an are of length 1 on the unit circle) and not a line 
making an angle of 1 degree or 1 mil. 

Polar coordinates are particularly convenient for defining spirals. Tor 
example, the locus of r = 6 is a double spiral each of whose points has a 
distance from the origin equal to its direction angle, as sketched in Fig. 
4-49, Note that the second spiral corresponds to negative values of r and θ. 


Graph: (n) r= 26, (0) r= (4, (p) r= @. 


PROBLEMS 


1. Suggest a way in which we could limit the ranges of r and 6 to subsets of 
the real numbers so that there would be a one-to-one correspondence between 
points and ordered pairs of polar coordinates. 


In Problems 2 through 11 make a table of values of r and 6 and sketch the 
curve. 


2, γὲ = 1. 3. r2 + 2r —1=0 
4,r> 0. δ. ὃ <r<4 
θβ.ὄ9--Ὑ Δλοκ-κθ « 2z. 7.7 = 80. 

8.r = 6/2. 9,r = 170 
10.r<@6@AD<46< 4. li. r « @. 


*12. Show that when r > 0, (7,0), = (7, @) pe? AK KET A 0 = φ- 2kr. 
(J = the integers.) 


Φ 
(3, --2π,8) 


Figure 4-490 Figure 4-50 
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ANSWERS TO EXERCISES 

(a) through (g) See Fig. 4-50. (8) Adding 7 to the angle is equivalent to 
reversing the direction of r. Any point with r = Ὁ must be at the origin. In 
(2) k must be integral since we arrive at the same point only by complete rota- 
tions, i.e., integral multiples of 27. (1) Circle of radius 2 with center at origin. 
(j) Same. (k) y-axis. (1) Line in quadrants II and IV through origin mak- 
ing angle of about 57° with positive z-axis. (m) Approximately the z-axis. 
(n) Spiral like that of Fig. 4-49 except that it “expands” more rapidly. 
(o) Same as Fig. 4-49 for θ > 0, for 6 < 0, curve is mirror image in z-axis of 
curve for @ > Ὁ. (p) Similar to Exercise (0) but “expands” at increasing rate. 


ANSWERS TO PROBLEMS 
1. Limit 7 to non-negative values and @ to values in (O27), with the condi- 
tion that only @ = 0 is to be paired with r = 0. 3. Two circles, r = —1-+ V2 
andr = —1— V2. 5. Region between two circles. 7. Similar to Fig. 4-49. 
9. Approaches the origin as @ gets large, and approaches the z-axis as @ ap- 
proaches 0. 11. Entire plane except orgin. 


4-11 The sine and cosine, Since every point in the plane determines 
a unique vector from the origin to itself, we can designate a vector by 
polar coordinates of this point as well as by the rectangular coordinates. 
If we limit ourselves to non-negative polar distances, the polar distance 
is simply the length of the vector. A direction angle is an angle from the 
positive z-axis to the vector. In this sense a vector is determined pre- 
cisely by its length and direction, even though many congruent angles 
may be used to indicate the direction. Ordinarily we choose for direction 
angle the smallest number in the interval (027). 

Figure 4-51 shows a vector with components v and y, length r, and direc- 
tion angle θ. We know from (4-4-1) that 


(1) r= V2? + 7. 


Hence we can easily find the length of a vector when we know its com- 
ponents. But how can we find the direction angle? And if we know r and 


y 


v 


FIGuRE 4—51 FIGURE 4-52 
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6, how can we find x and y? Problems of this kind arise in many applica- 
tions. In certain simple cases the questions can easily be answered. For 
example, if 6 = 7/2, then = Oandy = r. If x = 0, then 


b= 7/2 7 OS oT 2 


(or 6 is some angle congruent to one of these) and y= 1 V y= —|] 
correspondingly. If we know that « = 0 A y = --Ἰ, we have immedi- 
ately that r = 1 A @ = 37/2. 

Find the missing coordinates: (a) @= 7, r= 2; (b) = 2, y = 0, 
(ec) cx = —l,y = 0. 

Such simple cases are not limited to those where the terminal point 
lies on one of the axes. Jor example, suppose 6 = 7/4, as illustrated in 
Fig. 4-52. Then clearly the triangle is isosceles, x = y, and 2? + y? = r? 
becomes 2x2 = r?, which implies x = r/V/2 = y. 

Find the missing coordinates: (d) 6 = 39/4, r=1; (0) 2 = y = 2; 
(f)2=ly=—l. 

The reader can soon convince himself that except in such special cases 
(of which we shall consider others later), there seems to be no easy method 
of finding ὦ and y from r and θ or of finding @ when x and y are given. 
Nevertheless, it is clear that « and y determine @ uniquely (in the sense 
that they determine a set of congruent angles) and that 6 and r determine 
x and y uniquely. 

We begin by considering the special case 7 = 1. Then we have a unit 
vector which we imagine placed initially along the positive z-axis. If ° 
we rotate it in the counterclockwise direction, its terminal point covers 
an are θ. To each value of 6 there corresponds a unique x and y. We call 
the value of y corresponding to a given value of 6 the sine of 6 (abbreviated 
sin 6), and the value of x corresponding to a given value of 6 the cosine of θ 
(abbreviated cos 6). The history of these names is of interest. Figure 
4-53 shows the angle 26 with the chord equal to 2y. At least two thousand 
years ago, the problem was posed in terms of finding 2y when 26 was known. 
Because of the resemblance of the figure to a bow, the chord came to be 
called a sinus, a Latin translation of a Sanskrit word for bowstring. The 
term cosine was suggested by the fact that the cosine of θ is the sine of the 
angle 7/2 — 6, called the complementary angle. This is suggested in 
Fig. 4-54. (For formal definitions of sine and cosine, see Problem 34 at 
the end of the chapter.) 


(g) Explain Fig. 4-54. (h) Argue that sin @ = cos (1/2 — 6). 
Tables of sine and cosine. Because of their many uses, it is convenient to 


have tables of the sine and cosine. Table II in the Appendix gives sin @ 
and cos θ approximately for certain values of @ between 0 and 7/2 (1.57 
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i ee (1/2 — @) 


FIGURE 4-54 


Figure 4—55 


approximately). The corresponding angles in degrees are given also. 
Although the table could be constructed by careful measurement of a 
large figure, actually the entries were found by consulting a much more 
accurate table that was constructed by methods beyond the scope of this 
book. 

From the table we can easily find the sine and cosine of any angle. To 
see how this can be done, study Fig. 4-55. There are four points, P;, Po, 
P3, P4, on the unit circle determining four vectors making equal angles 
A with the z-axis. Thus the vector P; makes an angle of A with the 
negative x-axis and its direction angle is 7 — A. Similarly, the direction 
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angles of δα and P, are 7 -+ A and 27 — A. Hence the four right tri- 
angles determined by the vectors and their components are congruent. 
Therefore the sines and cosines of these four angles have the same absolute 
values. We see that the sine and cosine of any angle have the same ab- 
solute values as those of some angle in the first quadrant, this angle being 
the smallest non-negative angle between the corresponding vector and the 
z-axis. Then we determine the sign by noting the quadrant in which the 
angle lies and seeing whether its sine and cosine are positive or negative. 

lor example, to find the sine and cosine of 2.17 (145.6°) we note that 
2.54 = 3.14 — 0.60, and hence that the angle lies in the second quadrant, 
similarly to point P, in Fig. 4-55. Hence its sine and cosine are the same 
in absolute value as those of 0.60, that is, approximately 0.56 and 0.83 
from Table II. But we can see from the figure that the sine and cosine of 
an angle in the second quadrant are, respectively, positive and negative. 
Hence sin 2.54 = 0.56 and cos 2.54 = —0.83. 


Find sine and cosine of: (4) 2.64, (j) 3.94. 


Consider the reverse problem: given the sine and cosine, to find the 
angle. To do this we sketch the angle by locating the unit vector with 
z- and y-components respectively equal to the given cosine and sine. Then 
we find the angle between this vector and the z-axis by looking in Table II 
under the absolute values of the given sine and cosine. The desired angle 
is then found by reference to Fig. 4-55. For example, suppose cos 6 = 0.70 
and sin @ = —0.72. The vector is sketched in Fig. 4-56. From Table IT, 
A = 0.80 (45.8°). Hence 6 = 27 — 0.80 = 6.28 — 0.80 = 5.48. Note 
that we need both sine and cosine. If just sine or cosine were given, there 
would be two angles in the interval (0 2π). Of course, the angle found, 
plus any integral multiple of 27, is a congruent angle with the same sine 
and cosine. 


(0.70, —0.72) 


FIGURE 4-56 FIGURE 4-57 
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Find 6 whose sine and cosine are: (k) —0.91, 0.42, (1) —0.20, —0.98. 
Find two positive angles less than 27 whose (m) sine is 0.50, (n) cosine 
is —0.57, (0) sine is —0.91. 


Having dealt with the problem for unit vectors, we find it easy to handle 
others, for any vector is a multiple of a unit vector. If we are given a vec- 
tor (7, 4)», its components are evidently r cos θ and r sin 6, as sketched in 
Fig. 4-57. This enables us to find immediately the components of any 
vector from its length and direction angle. On the other hand, it is evident 
that given (zx, y) we can find the length from (1) and a direction angle by 
finding an angle @ such that cos 6 = 2/r and sin 8 = y/r. We may sum- 
marize by 


(2) [(ὦ, ψ) = (7, θ)ρ] © [2? + y? = r? A cos 6 = 2/r A sind = y/r]. 


These relations may be used to go from (2, y) to (r, 4)», or from the latter 
to the former. 


Find the components of the vector (p) of length 3 and direction angle 60°, 
(q) of length 10 and direction angle 4.54. (r) Find a direction angle of 
(2, —38.5). 


Another way of formulating the problem of this section is to ask for 
polar coordinates of a point whose rectangular coordinates are given, or 
vice versa. Hence this problem is solved also by (2). | 


(s) Find polar coordinates of (—1, —1). (Ὁ) Find the rectangular coordi- 
nates of (2, —3m/4)>. 


Interpolation. Table II may be used with angles, sines, or cosines that 
do not appear in it. Suppose, for example, that we wish the sine of 0.54. 
We have sin 0.50 = 0.48 and sin 0.60 = 0.56. Since 0.54 is 4/10 of the 
way from 0.50 to 0.60, we estimate that sin 0.54 is 4/10 of the way from 
0.48 to 0.56. The total distance being 0.08, we find (4/10)(0.08) = 0.03, 
and estimate that sin 0.54 = 0.51. This process is called linear interpola- 
tion. It is based on assuming that the changes in @ and sin @ or cos 6 are 
proportional. The assumption is not true, but the process gives sufficiently 
accurate results if the differences are not very large. 


Estimate: (u) cos 0.54, (v) sin1.32,  (w) sin 5.30. (x) Find sin 0.54 
by using sin 0.52 and sin 0.60. 


Linear interpolation can be used in reverse to estimate an angle from 
its sine or cosine. For example, suppose sin θ = 0.74. We have sin 0.80 — 
0.72 and sin 0.87 = 0.77. Since 0.74 is 2/5 of the way from 0.72 to 0.77, 
we estimate 6 to be 2/5 of the way from 0.80 to 0.87, that is, 9 = 0.80 + 
(2/5)(0.07) = 0.80 + 0.03 = 0.83. 


(y) Find approximately the angle in the first quadrant whose sine is 0.88. 
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The reader may find it convenient to arrange in a table the numbers | 
involved in an interpolation. We do this for the examples above. 


θ sin 6 
| 0.50 0.48 | 
4 h 
(3) 10 0.84 2 Lig 
0.60 0.56 J 
sin 6 θ 
| 0.72 0.80 | 
2 h 
(4) 5| 10.74 ? 7 
0.77 0.87 J 


In each case h is the unknown distance to be estimated. In (3) we have 
h/8 = 4/10 or ἃ = (2/5)8. In (4) we have h/7 = 2/5 or h = (2/5)7. 
Note that the differences have been multiplied by 100 to avoid decimals. 
Common sense dictates placement of the decimal point in the result. 


(z) Tabulate as in (3) and (4) your interpolations in Exercises (u) through (y). 


PROBLEMS 


1. A plane flies 20° north of east for 90 min at 200 mi/hr. How far east and 
north of its origin is it? 

2. A plane is 100 mi west and 250 mi south of its starting point. What is its 
distance and direction from the starting point? 

3. Sketch four angles between 0 and 2π whose sine in absolute value is 0.5. 

4, Sketch four angles between 0 and 2m whose cosine in absolute value is 0.75. 


In Problems 5 through 13 find the rectangular coordinates of the point and 
sketch. 

5. (1, 605). 6. (3, 2). 

8. (2.5, 2π,3). 9, (δ, 3105)». 

11. (100, 5.1)». 12. (255, 345°),. 


In Problems 14 through 16 find the components of the vector with length and 
direction angle as given and sketch. 


14, 15, 5/4. 15. 500, 215°. 


In Problems 17 through 20 find the length and direction angle of the vector 
and sketch. 
17. (—5, 3.4). 
19. (—2, —2). 


7. (—1, 455)». 
10. (4, —32/4)>p. 
13. (24, —93°)». 


16. 1.2, 3.7. 


18. (100, 3). 
20. (1000, 2000). 
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cos r/3 


δὰ 


FIGuRE 4-58 FIGURE 4-59 


21. Figure 4-58 shows an angle of 7/3 (60°). By connecting the terminal 
point of the are with (1, 0), we form an equilateral triangle whose altitude has 
length sin 7/3. From this find cos 7/3 and sin 7/3. Check against Table ITI. 

22. Similarly, find the sine and cosine of 1/6, 24/3, 54/6, 71/6, 44/3, 53/3, 
and 1117/6. Make drawings. Check your results against Table II. 

23. Make a table showing the values of sine and cosine for 0 and all multiples 
of 7/6 and 7/4, giving exact expressions for each value. 


It is an empirically established law of physics that if forces be represented 
by vectors (the length representing the magnitude of the force, the direction 
representing the direction of the force), two forces applied at a point have the 
same effect as a single force (called the resultant) corresponding to the vector 
sum. In brief, forces add like vectors. This law is all that is needed to solve 
Problems 24 through 29, 


* 24, Suppose a force of 60 Ib is exerted in a direction 50° above the horizontal. 
What two forces in the horizontal and vertical directions would have the same 
effect? | 

*25. A block is dragged along a plane by a rope making an angle of 30° with 
the plane. What force on the rope is required to yield a force of 20 Ib in the 
direction of motion? (See Fig. 4-59.) 

* 26. Two forces, one of 20 lb in the direction 140°, the other of 50 Ib in the 
direction —80°, act on a body situated at the origin. Find the magnitude and 
direction of the resultant. (Suggestion: Resolve each force into its components, 
add according to (4-3-1), then find the length and direction of the sum.] 

* 27. Forces are said to balance or be in equilibrium when their resultant is the 
null vector. What force must be added to those in Problem 26 to create equilib- 
rium? 

Ἂ 28. Find the resultant of three forces: (10, 300°); (—25, 200°); (5, 90°). 

*29. A force eastward of 20 lb, a force northward of 50 lb, together with a 
third force, just balance a force of 100 lb in the southwest direction. What is 
the third force? 


* 30. From a plane 10 mi above the earth the angle between the line of sight 
to the horizon and the vertical is 85.6°. From this calculate the radius of the 
earth. 
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*% 31. In one of Leonardo da Vinci’s notebooks appears the following: “If you 
would ascertain the exact distance of the breadth of a river proceed as follows: 
plant a staff upon the river bank at your side and let it project as far from the 
ground as your eye is from the ground; then withdraw yourself as far as the 
span of your arms and look at the other bank of the river, holding a thread 
from the top of the staff to your eye, or if you prefer it a rod, and observe where 
the line of sight to the opposite bank meets the staff.” Draw a sketch and write 
a formula for calculating the width. 

*32. Figure 4-60 shows a body exerting a force of 50 lb hanging from cables 
attached to a horizontal rigid support at A and B. From the dimensions, find 
the forces exerted on the support at 4 and B. 


GY A, 2, Cy 8, 


50 lb 


FigurE 4-60 


* 33. A soldier walks 1 mi 30° east of north, 2 mi 50° south of east, 1 mi 40° 
west of north, and 5 mi 45° south of west. Where is he? 
* 34. We could have defined sin @ formally as follows: 


(5) sin? = [ν 9 dx (a, y) = (1, Aol. 


Formulate a similar definition for cos 6. Justify and explain these definitions. 
«35. Show that the equation 7 cos τ 1 is the equation of a straight line by 
transforming it to rectangular coordinates. 

* 36. What is the polar graph of r.= 1/sin 6? 

*37. In Table II, sin @ = θ for small θ. How does this happen? 


ANSWERS TO EXERCISES 


(a) (—2,0). (b) , 0). (ὁ (mp. (ὦ (—1/V2, 1/V2). (e) 2V2, 
m/4)> (ὃ (72, 74/4)p. (6) The two triangles formed by the vcetors with 
angles @ and 7/2 — @ are congruent since they have hypotenuses of equal length 
and an equal angle. Hence the sine of r/2 — @ is the same as the cosine of @. 
(h) Similarly from Fig. 4-54. (i) 2.64 = 3.14 — 0.50. Hence |sin 2.64| = 
sin 0.50 = 0.48 from Table II. But the sine is positive in quadrant 11. Hence 
sin 2.64 = 0.48. Similarly, [cos 2.64 = 0.88, but the cosine is negative in 
quadrant II, and hence cos 2.64 = —0.88. (j) 3.94 = 3.14 + 0.80; —0.72, 
—0.70. (k) sin 1.13 = 0.91, cos 1.13 = 0.42. Since sine is negative and 
cosine positive, angle is in fourth quadrant. Hence @ = 2π — 1.13 = 6.28 — 
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1.18 = 5.15. (1) 3.3. (m) 0.52, 2.62. (n) 2.58, 3.75. = (0) 4.27, 5.15. 
(p) (1.5, 2.6. (q) (—9.85, —1.7). (Ὁ —1.08. 6) (2, 5ar/4)p. 

(Ὁ) (-vV/2, —vV/2). (ἃ) 0.86. Note that the cosine is decreasing so that 
we go 4/10 of the distance from 0.88 to 0.83, that is, of a directed distance 
—0.05. (v) 0.965. Ordinarily we keep no more decimal places in interpolation 
than are in the table, but here the differences are very small. (w) sin 5.30 = 
—sin 0.98 = —0.83. (x) 2/8 of the way from 0.50 to 0.56, or 0.515, which 
might be rounded off to either 0.51, as found before, or to 0.52. (υ) 1.09. 
(z) Note that the differences are negative in some cases, since they are directed 
distances. For proportionality, the arrows must all go the same way on the 
table. 


ANSWERS TO PROBLEMS 


1. 282 mi east, 102 mi north. 3. 30°, 150°, 210°, 330°. 5. (0.50, 0.87). 
7. (—0.71, —0.71). 9. (3.20, —3.85). 11. (88, —92). 13. (—1.2, —23.98). 
15. (—410, —253). 17. (6.1, 2.58). 19. (20/2, 54/4)p. 23. See (6). 


(6) 


11π76 


4-12 Trigonometric identities. From the definitions οὗ sin θ and cos @ 
as coordinates of a point on a unit circle, it follows immediately that 
(sin 6)? + (cos 6)? = 1. Writing sin?@ τὰ (sin 6)? and cos?@ --- (cos 6)”, 
we have the identity 


(1) sin?@ -+ cos?@ = 1. 


(a) Verify (1) for selected values of θ, including some from Table II. 


The sine and cosine equal the ratios y/r and x/r, where r = |(z, y)|. 
It is natural to give names to other ratios of x, y, and r. 


(2) Def. tan @ = sin 6/cos @, 
(3) Def. cot 6 = cos 6/sin θ, 
(4) Def. sec 9 = 1/cos θ, 


(5) Def. | esc @ = 1/sin @. 
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A b 


Ω 


FigurE 4-61 FiGgurE 4-62 


The symbols on the left of definitions (2) through (5) are abbreviations 
for tangent 0, cotangent 0, secant θ, and cosecant 0, respectively. The names 
have a geometrical significance suggested by Fig. 4-61. There, by similar 
triangles, CB/OC = sin 6/cos @ = tan 6. But OC = 1. Hence tan é 
is just the length of the tangent from C to the terminal side of the angle @. 


(b) Show that cot @ = ED, sec@ = OB (the length of the secant), and 
esc 6 = OD. (ὁ For what values of @ is tan@ undefined?  (d) cot θ᾽ 
(6) sec 0? (ὃ ese @? (8) In Table II verify that the values in the tangent 
and cotangent columns conform to (2) and (8). (h) Argue for the identity 
cot @ = 1/tan@. (i) Show that the slope of a straight line is the tangent of 
any direction angle of any vector lying in the line. 


With definitions (2) through (5) we have a name for the ratio of any 
two sides of a right triangle. Figure 4-62 shows a right triangle with sides 
and angles as indicated. If we imagine this triangle placed with angle A 
at the origin and the side AC along the x-axis, we see that 


sin A = a/c = opposite side/hypotenuse, 
cos A = b/c = adjacent side/hypotenuse, 
tan A = a/b = opposite side/adjacent side, 
” cot A = b/a = adjacent side/opposite side, 
sec A = c/b = hypotenuse/adjacent side, 
esc A = c/a = hypotenuse/opposite side. 
These relations make the tables of these ratios very useful in finding 
sides or angles of right triangles. For example, suppose we wish to find the 


distance across a lake, as pictured in Fig. 4-63. We sight from C’ to a 
point B on the opposite shore. Then we measure 100 ft at right angles to 
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B 


ae 
C 


Figure 4-63 


A 100 


this line of sight to the point A. We find by sighting to B again (and using 
a protractor or some more accurate surveying instrument) that the angle 
at A is 65°. Then we have a/100 = tan 65° or a = 100 tan 65°. From 
Table If tan 65° = 2.1. Hence a = 210 ft to the nearest 10 ft. 

Finding the remaining sides and angles of a triangle when some of them 
are given is called solving the triangle. It was for solution of such problems 
that the ratios of sides were originally given names and tabulated. The 
name “trigonometry” for this branch of mathematics comes from Latin 
words meaning “triangle measurement.” However, trigonometry ex- 
panded far beyond the original purpose. The relations in (6) have mean- 
ing only for angles less than 7/2, whereas we have defined sin @ for all 
real numbers. Many of the most important applications of trigonometry 
have nothing to do with triangles. 


(1) Solve the lake problem if the base line AC is 50 yd long and the angle A 
is 34°, 


Numerous identities connect the six trigonometric ratios. Some are 
suggested by Figs. 4-61 and 4-62. For example, 


(7) 8720 + 1 = sec?6 
(8) 1 + cot?¢ = esc76 
are suggested by considering the right triangles OCB and OED in Fig. 4-61. 


(k) Prove (7) by dividing both members of (1) by cos76. (1) Prove (8). 


Further identities can be constructed without limit by manipulating 
the definitions and the above identities. If the reader did Problems 13 
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through 20 in Section 2-12, he has already proved a number of these, for 
(2-12-1) through (2-12-5) are just (1) through (5), where S, C, T, ἐ, ς, 
and s are abbreviations for sin 6, cos 6, tan 6, cot θ, sec 6, and esc 6. These 
identities are often useful for simplifying trigonometric expressions. 


Simplify: (m) sin26/(1 — cos 6), (n) (sin zsec x)/{tan x cot 2). 

The following identities, which are not so evident as (7) and (8), play 
a very important role in trigonometry. 
(9) sin (6 + ¢) = sin θ cos ¢ + Cos @sin 4, 
(10) cos (@ -+- ¢) = cos @cos φ — sin @ sin ¢. 


(0) Show by a counterexample that sin (@-++ @) = sin @- sin Φ 15 not a law. 


In order to prove (9) and (10), we find it convenient to prove first 
(11) cos (9 — ¢) = cos θ 608 ¢ + sin @sin ¢. 


For this purpose we visualize the angles @ and ¢ in standard position in 
Fig. 4-64. We have chosen ¢ in the second quadrant and @ in the third, 
but everything in the proof remains the same regardless of the size of the 
angles involved. By definition, the coordinates of P, the terminal point 
of the unit vector with direction angle ¢, are as indicated. Similarly, the 
point Q has coordinates (cos @, sin @). The directed are PO is 6 — ¢. Now 
we rotate the angle θ — ¢ (consisting of the vectors OP and OQ, and the 
arc PQ) around the origin until OP lies along the z-axis. Then P goes into 
the point (1, 0) and Q goes into Q’. Since Q’ is the terminal point of a vec- 


υ 


Q’: (cos (θ — φ), 810 (θ᾽ — φ) 


P: (cos ¢, sin ¢) 


θ-- φ P’: (1, 0) 


@: (cos @, sin @) a 


FIGURE 4-64 
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tor with direction angle 6 — φ, its coordinates are, by definition, the cosine 
and sine of -- ¢. Now we assume that this rotation has not changed the 
straight line distance between the point P and Ὁ, 1e., that P’Q’ = PQ. 
But 


(12) PQ? = (cos 6 — cos ¢)” -Ε (sin 9 — sin ¢)’, 


(13) P’Q’* = (cos (6 — ¢) — 1)? + (sin (6 — φ) — 0)". 


Expanding the right members, simplifying by using identities, and setting 
the two results equal, we find (11). 


(p) Do this algebra! 


Now the derivation of (9), (10), and many other identities, is easy if 
we make use of our knowledge of the sine and cosine of special angles such 
as 0 and 7/2. First (11)(@:7/2) is 
(14) cos (7/2 — φ)ὴ = cos7/2 cos ¢ + sin 7/2 sin ¢. 

But cos 7/2 = 0 and sinw/2 = 1. Hence we have proved 
(15) cos (7/2 — φΦ) = sing. 
(q) From (15) by (¢:1/2 — 9), derive (16). 


(16) sin (7/2 — ) = cos φ. 


Now (15)(¢:6 + ¢) yields (9) if we note that 7/2 — (@+¢) = 
(7/2 — 6) — and use (11). 


(r) Do this! 


Similar devices enable us to prove 
(17) cos (—¢) = cos φ, 
(18) sin (—¢) = —sin ¢. 
With these results we can derive (10) by writing 
cos (8 + Φ) = cos (@ — (—¢)) 


and using already proven identities. 


(s) From (9) derive a formula for sin (@ — φ) by noting that @ — @ = 6+ 
(--φ). 
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The entire theory of trigonometry may be derived from the identities 
proved above. A number of important results are included in the exer- 
cises. These laws are important for their role in mathematics, and their 
proofs provide good practice in heuristic (Section 2-12) and in methods 
of deriving identities (Section 2-8). 


PROBLEMS 


1. What two angles in (0 2m) have tangent ᾿ 

2. Draw a diagram and show that tan 7/4 = 

3. A vector has z-component —2 and ee angle 125°. What is its 
length and y-component? 


In Problems 4 through 7 find the missing elements from among 2, y, 7, and 
6 of each vector. 


4,y = —3,0 = —39°. 


Ds 20 2 = 3; 
6.r=4,y = —6. 7. 2 


—10, @ = 237°. 


8. Velocities may be represented by vectors whose lengths give the speeds 
of motion and whose directions give the directions of motion. A projectile is 
moving at an angle of 40° with the horizontal. Its height above the ground 18 
decreasing at the rate of 50 ft/sec. What is its speed? What is the horizontal 
component of its velocity? 

9. Prove that [P sin θ = mv2/R A P cos @ = mg] — [tan @ = v?/Rg]. (Sears 
and Zemansky, University Physics, p. 105) 

10. Why would it be sufficient to replace Table II by one covering only 
angles in the range (0 2/4)? 

11. Argue for the plausibility of (17) and (18) by drawing a figure showing 
angles ¢ and —¢@ in standard position. 

12. Prove (19), (20), and (21). (Note: These properties are described by say- 
ing that the sine and cosine are periodic of period 2m and that the tangent is 
periodic of period 7.) 


(19) sin (@-+ 27) = sin 9, 
(20) cos (8+ 27) = cos 6, 
(21) © tan (θ -ἰ- 7) = tan @. 


13. The following are called the double-angle formulas. Derive them from 
(9) and (10). 


(22) sin 20 = 2sin @ cos 8, 
(23) cos 20 = cos2@ — sin?@ = 1 — 2sin?0 = 2cos7@ — 1. 
14. Use (23) to derive 


(24) sin? = +V (1 — cos 26)/2, 
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(25) cos? = +V (1 -+ cos 26)/2. 


15. Using (4-11-6), find an exact expression for sin 15°. 

16. How does one choose which sign to use in (24) and (25)? 

17. Verify (24) and (25) for selected values in Table II. 

18. By using the definition of tan@ and previous identities, derive the 
following. 


(26) tan (θ — Φ) = (tan @ — tan ¢)/(1 + tan θ tan 9), 
(27) tan ¢/2 = +V (1 — cos ¢)/(1-+ cos ¢) = (1 — cos ¢)/sin φ. 


19. Find formulas for tan (@ + Φ) and tan 2¢. 


20. Prove 
(28) sin + sing = 2sin(- ioe | 
(29) sin@ — sing = 2cos (“+*) sin ("= *), 
(30) cos 0+ cos. = 20s (“ +) co ((Ξ-2 
(31) cos 9 -- cos φ = 25η (“ +2) si n (=) 


*21. Show that Va,b 34,6 asin θ -ἰ- bcos? = Asin (6+ φ). 

22. Show that tan (7/2) = esc x — cot 2. 

23. A plane is flying on a course 80° east of north at an air speed of 200 mi/hr. 
The wind is blowing from the south at a speed of 45 mi/hr. What are the 
actual ground speed and direction of flight? 

24. Prove that sec @ = +V1-+ cot?6@/cot θ. 


Prove the following identities. 


25. cos A/(1 -+ sin A) = (1 — sin A)/cos A, 

26. (cot x + cot y)/(tan x + tan y) = cot zx cot y, 
27. tan 0/2 = sin θ7.(1 -Ε cos 6), 

28. tan(—8) = tan 6. 


ANSWERS TO EXERCISES 


(b) ED/OE = sin 6/cos 6; but OE = 1. (c) Odd multiples of 1/2. 
(d) Multiples of 7. (6) Same as tan@. (f) Same as cot θ. (h) a/b = 
1/(b/a). (i) Imagine the origin at the initial point of the vector (zx, y); then 
tan θ = y/x = the slope of the line. (j) 34 yd. (1) Divide both members 
of (1) by sin? 6. (m) Multiply numerator and denominator by 1+ cos 0: 
final result is 1-+ cos@ (n) tana. (0) θ -- φ = 7/2. 
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ANSWERS TO PROBLEMS 


1. 45°, 2255, 3. y = 2.8, r= 3.5. 5. y = 2.2, θ = 48° or y = —2.2, 
θ = —48°. 7. y = —16,r = 19. 13. (6:0). 15. 0.5V2 — V8. 21. asin é+ 
bcos @ = A[(a/A) sin 6+ (b/A) cos 6], where A = Va? + b?; then let d = a 
direction angle of the vector (a, ὃ) so that cos 6 = a/A,sin@ = 6/A. 26. Mul- 
tiply both members by the denominator of the left member. 


*4-13 Angles and the dot product. We can now derive a formula 
giving the angles formed by any two lines. We first define the inclination 
of a line as the smallest non-negative angle through which the z-axis 
must be rotated to be parallel with the line. As indicated in Fig. 4-65, 
the inclination is a real number in the interval (07). It follows from the 
definition and Exercise (i) in Section 4-12 that the tangent of the incli- 
nation is the slope of the line. 


(a) Explain and justify the last statement with the aid of a diagram. 


We define the directed angle from a line L, to a line Lz as @2 — $1, where 
φι and ¢2 are the inclinations of the lines. It is an angle through which 
L, may be rotated to coincide with Ly. Now we have tan (¢2 — ¢1) = 
(tan dg — tan ¢1)/(1 + tan φ: tan ¢2) from (4-12-26). But tan φ1 = m1, 
the slope of 1.3, and tan ¢2 = me, the slope of Lz. Hence 


(1) tan (62 — $3) = Tom ms 

Suppose for example, that m; = land m2 = 2. Then tan (¢2 — φ1) = 
1/3 = 0.33 and ¢2 — ¢, = 18°. Hence the angle from the first to the 
second line is about 18°. That is, if we rotate the first line through 18°, 
it will coincide with the second. Of course we could also rotate it through 
—162°, and tan (—162°) = tan 18°, so that both angles are given by (1). 


C: (ὃ cos A, ὃ sin A) 


A: | (0, 0) € B: (c, 0) 


FiGuRE 4—65 Figure 4-66 
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(Ὁ) Sketch the lines y = x-+ 1 and y = 2x — 3, showing the angles just 
found. (0) What is the angle from the line y = —x-+ 4 to the line y = 
--8. -᾿ 6? (d) Derive (4-7-6) from (1). (6) Derive (4-7-8) from (1). 


Let ABC be a triangle with angles A, B, C and opposite sides a, ὃ, and c. 
We place the triangle with the angle A at the origin and the side AB along 
the positive x-axis. Then, as indicated in Fig. 4-66, the direction angles of 
the vectors AB, BC, and AC are, respectively, 0, 7 — B, and A. Since 
the lengths of the vectors are c, a, and b, respectively, we have from 
(4-11-2), AB = (c,0), BC = [acos (τ — B), asin (x — B)] = 
(—acos B, asin B), and AC = (6cos A, bsin A). Since AB + BC = AC, 
we find 


(2) (c — acos B,asin B) = (bcos A, ὃ βίη A). 


(f) Prove that cos (π — B) = —cos B and sin (w — B) = sin B. 


From (2) we have immediately asin B = bsin A or (sin A)/a = 
(sin B)/b. Since we could have labeled the sides and angles differently, 
this relation must hold for any pair of sides and corresponding angles of a 
triangle. We have derived the law of sines. 


snA sinB_ sinC 
(3) a b- ¢ 


(Law of sines). 


Again from (2) we have c -- acosB = beos A or c= acosB+ 
bcos A. But since we could have labeled the figure differently, we may 
write ὦ = ccos A + acosC and a= bcosC + ccosB. Multiplying 
both members of these equations by c, b, and a, respectively, we have 
c*=accosB+becos A, 67 = becosA+abcosC, and a? = 
ab cosC’+ accos B. Subtracting the last two from the first, we find 
c? — b? — a? = —2abcosC. We have derived the law of cosines. 


(4) c? = a? +b? — 2abcosC ([μιν of cosines). 


(g) Show that (4) reduces to the Pythagorean theorem when the triangle 
is a right triangle. (ἢ) In a certain triangle, two sides and the included angle 
are, respectively, 2, 3, and 25°. Find the third side from (4). Then find the 
other two angles approximately by using (3). (i) In a certain triangle, 
A = 35°, B = 40°,b = 10. Find C, a, and ὁ by using (3). 


As illustrated in the exercises, the laws of sines and cosines are useful for 
solving triangles. The interested reader will find details for the best tech- 
niques for using them and other formulas for that purpose in books on 
trigonometry and surveying. The laws also appear occasionally in scien- 
tific literature, and the law of cosines may be used to derive another 
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formula for finding the angle between two lines. Indeed, let uy = (1, y1), 
Us = (Xo, Yo), and 5 — uy = (%2 — 2X1, Ye — ¥1) form a triangle, as 
in Fig. 4-67. By the law of cosines 


[ug — uy]? = [uy]? + [5 — 2[πι| [ua|cos ¢, 
where ¢ 15 the angle between the vectors u,; and uy. Using (4-4-1), we find 
(xg -- αι)" + (y2 — 1)? 
= aft υἱ Ὁ αβ +o} — QV οἱ + νῦν οὐ + yh cos 6. 
Expanding and simplifying, we find 


X1L2 + Ψι},2 


(5) cos Φ = SSS 
Var + yi να + υἹ 


(j) Carry through the algebra! (k) Derive (4-7-8) from (5). ([) Derive 
(4-7-6) and (4-7-7) from (5). 


Formula (5) can be simplified by the definition of a kind of multiplica- 
tion of vectors. We define the dot product u-v of two vectors u and v 
to be the sum of the products of the corresponding components; that is, 


(6) Def. (v1, Yr) * (2, Yo) = T1Le + YriY2. 
(m) Prove (7). 
(7) ju) = Vu-u. 


Then (5) may be written 
11. 0 


(5’) cos ¢ = Taal [tal 


In words, the cosine of the angle between two vectors is their dot product 
divided by the product of their lengths. 


Ug — Uy 


Projection 


FIGuRE 4-67 FiGuRE 4-68 
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(n) Show that 
(8) (α; L ug) <> (ἃ, τα = 0). 


The last result is one that makes the dot product very useful. 
_Another useful property results from writing (5’) in the form 


(9) Uy°Usg = [α1] |U2|cos φ, 


from which we see that the dot product of two vectors is the product of 
their lengths and the cosine of the angle between them. 

A vector may always be expressed as the sum of two perpendicular 
vectors, one of which 15 parallel to any given vector. As in Fig. 4-68, we 
can simply drop a perpendicular to the second vector from the terminal 
point of the first and so determine the two perpendicular vectors, which 
are then called components. It is said that the given vector u, has been 
resolved into the two components. The vector parallel to the second vector 
is said to be the projection of uy on Ug. 

Now, as suggested in Fig. 4-68, [πα cos ¢ is in absolute value the length 
of the projection of u, on ue, being positive or negative according as Φ 
is less than or greater than 90°. Hence if we multiply a unit vector in the 
direction of ug by |u;| cos ¢, we get the vector projection of uy on wus. 
Such a unit vector is u2//ug|. Hence the projection of uy on ue, that is, 
the component of ἃ: in the we direction, is [|u,| cos ¢][ue/|ugl], or by (9), 
(Uy 12... Ug) Ud. 


(10) The projection of 1 on Us ts ee 


Uo. 
U2°U2 


This result enables us to find the component of a vector in any direction 
without using trigonometry. We need know only a vector in the desired 
direction. To illustrate the formula in a trivial case, the component of 
(2, y) in the direction of (1, 0), the z-axis, is 


(1+ a2+0-+y)/1)(1, 0) = (a, 0), 


as expected. 


(0) When the projection of uy on ug has been found from (10), how can the 
component perpendicular to it be found? (p) Find the projection of (1, 2) 
on (3,1), and sketch. (ᾳ) The force of gravity on a body resting on an in- 
clined plane is 15 lb. If the slope of the inclined plane is 4, find the force parallel 
to it. Make asketch. (1) Resolve the vector (3, 4) into components parallel 
and perpendicular to the vector (1, 1). Sketch. 
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PROBLEMS 


In Problems 1 through 4 find the angle from the line with the first slope to 
that with the second. 


14:5. a ee 8. 3; 2. a οἱ 


5. Find the line through the origin bisecting the angle made by the positive 
x-axis and the vector (1, 3). 
6. What can you say about the inclination of a line with negative slope? 


In Problems 7 through 10 find the slope of the line with the given inclination. 
7. 185“, 8. 3. 9. 1/2. 10. 1. 


In Problems 11 through 14 find the angle from the first to the second line: 


11. 85 —4y4+2 =0;47 = y. 
12. 2x - 4y = 3;2 = —1. 

13. y = —x#+2;a4+y = 5. 

14. 92+ Τὸν = 1;102-+ lly = 2. 


15. A gun changes its aim by a small angle and the point of impact shifts by 
100 yd. If the range of the gun is 5000 yd, what was the angle of shift? 

16. Show that the vector (A, B) is perpendicular to the line Ax + By + 
C = 0. 
*17. Find a formula for the tangent of the angle between Ax + By +C = 0 
and A’x-+ Bly +C’ = 0. 
%18. Write a necessary and sufficient condition that the lines in Problem 17 
be perpendicular. 
*19. Prove the following laws. 


(11) uev = veu, 
(12) u-(v-+w) = u-v+u-w, 
(13) u- (kv) = k(u-v). 


420. Laws (11), (12), and (13) are similar to certain laws for addition and 
multiplication of real numbers. Does the dot product of vectors satisfy other 
laws analogous to those for real numbers? For example, does (1-16-1) hold for 
the dot product of vectors? Does (1-16-5)? Does (2-8-5) hold for the dot 
product of vectors if the two zeros in the right member are displaced by 0? 
*%21. Derive (4-7-10) by noting that d is the length of the projection on a 
vector perpendicular to the line of any vector joining a point on the line to 
(x0, Yo). 

422. The work done by a force F of constant magnitude and direction moving 
a directed distance 5 in the direction of F is defined as [ΕἾ |s|. Suppose a force F 
is exerted on a body and causes it to move a directed distance s at an angle @ 
with F. Show that the work done is Fs. | 

* 23. Find the interior angles of the triangle whose vertices are (1, 0), (8, —4), 
(—5, 7). Check by sketching. 
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Q 


Figure 4-69 B 


*24. Show that (julv + |v|u)/(|u} + |v|) bisects the angle between ἃ and v. 
*25. Figure 4-69 is adapted from Sears and Zemansky, University Physics, 
p. 731. ZAQO and Z ABO are right angles. From v/v’ = n’/n show that 
nsing = η sing’ (Snell’s Law). 


ANSWERS TO EXERCISES 


(a) Any vector in the line has direction angles equal to the inclination @ or 
to 6-+ w; but, (4-12-21). (Ὁ) Note that both angles are from first to second 
line. (0) —26.4°. (d) my — mo = 0 Ὁ tan (d2 — $1) = 0 © φὸ = 
gdit+kr. (e) 1+ mime = 0 cot (2 — $1) = 0 φ. — φι = 
w/2--+ kr. (ἢ Use (4-12-9), (4-12-10). (g) C = 2/2 > cosC = 0. 

(h) 1.4, 38°, 117°; draw sketch to see which values to choose from among those 
satisfying (3). (Gi) C = 105°, a = 8.9, ὁ = 15. (k) ᾧ = r/2+kro 


1 Y2 
cos@ = 0 «Ὁ a1272+ yiy2 = 00 πὶ 72) = —|l, 


(1) 6 = 0+ kr (cos g| = 1 (azo + yry2)? = (2? + y?)(22 + BO 
vty} — Qxiyerey1 + αν = 0 (r1y2 — tey1) = Ὁ «Ὁ ye/xe = y1/21. 
(m) (41:2, ψιτὼ, τα, yoty) in (6). (n) See Exercise (k). (o) By subtract- 
ing projection from original. (p) (1.5, 0.5). (q) u = (—4.5, —1.5), |u| = 
15/10 = 48. (r) (3.5, 3.5), (—0.5, 0.5). 


ANSWERS TO PROBLEMS 


1. 34°. 3. —8°. δ. y = 0.72 approximately. 7. —1l. 9. Undefined. 11. 39°. 
13.0. 15. 6 = 24 mils. 17. (AB’ — A’B)/(AA’ + BB’). 


CHAPTER 5 
RELATIONS AND FUNCTIONS 


5-1 What is a relation? In the sentence “I have good relations with 
my relations,” the word “relations” is used in two quite different senses. 
In its first appearance it refers to the relationship that I have with cer- 
tain people. In the second it refers to these people. In mathematics, 
“relation” is used only in the first sense, 1.e., in the sense of relationship. 

A relationship, or as we shall say from now on, a relation, is something 
that holds or does not hold between two things. For example, if x and y 
are real numbers, the relation of greater than, symbolized by >, either 
holds between x and y (in which case we write x > y) or it does not hold. 
This suggests viewing a relation as a property of pairs. But we note that 
the order of the two things is important. For example, if x > y, then 
~(y > x). This suggests that we might view a relation between two 
things as a property of ordered pairs. 

As we pointed out in Section 3-10, there is a one-to-one correspondence 
between properties and sets. Hence, if we view a relation as a property 
of ordered pairs, this amounts to thinking of it as a set of ordered pairs. 
This is the point of view taken in modern mathematics. We think of a 
relation as a set of ordered pairs, namely the set of pairs whose first com- 
ponent stands in the given relation to its second component. For ex- 
ample, we view > as the set of couples (2, y) for which x > y. 

The purposes of this chapter are (1) to show how relations may be under- 
stood as sets of ordered pairs, (2) to introduce the concept of a function as 
a special kind of relation, and (3) to apply these ideas to algebra, analytic 
geometry, and simple scientific problems. 


5-2 The cartesian product. The carteszan product, A < B, of two sets 
A and B is defined as the set of all ordered pairs whose first components are 
members of A and whose second components are members of B. 


(1) Def. AXB= {(zy|tEAAYEB}. 


For example, if A = {a,b,c} and B = {d,e}, then A X B = j{(a,d), 
(α, 6), (ὃ, ἀ), (ὖ, 6), (c,d), (c,e)}. If A= {-1,0,13, Ax A= 
coat aA) [31 0), {-:.} 1), (0, =), (0, 0), (0, 1), (1, ΞΞΤῈ (, 0), ar 1)}. 
Find AX Bfor: (a) A = {1,2,3}, B = {4,5}; (b) A = B = (0,1, 2}. 
(c) How is the limitation that A and B are sets “built in” to the definition? 
(4) Argue for (z,y) E AX BercEeEAA YyEB. 
244 
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The cartesian product is a useful concept when we wish to talk about 
ordered pairs whose components are chosen from certain sets. For ex- 
ample, in plane analytic geometry we identified points with pairs of real 
numbers. Hence each point is identified with a member of Re X Re, where 
Re = the real numbers, and we may view the plane as a geometric picture 
of Re X Re. The cartesian product is named after Descartes, one of the 
inventors of analytic geometry, and a plane with axes is often called a 
cartesian plane. 

The fundamental idea of plane analytic geometry is that there is one 
and only one point corresponding to each member of Re Χ Re and one 
and only one member of Re X Re corresponding to each point. (See Sec- 
tion 4-1.) A similar conception can be applied to pairs that belong to 
cartesian products other than Re Χ Re. Suppose, for example, that we are 
interested only in the digits G, {0,1,2,...,9}. Then G Χ G consists of 
the 100 ordered pairs (points) whose components (coordinates) are digits. 
We are not even limited to numbers. For example, we may wish to vis- 
ualize the pairs chosen from 5 persons, a, b,c,d, and e. Letting S = 
{a, ὃ, c,d, e}, S X Sis a set of 25 ordered pairs. We may assign a point 
on the x-axis and on the y-axis to each member of S. Then each ordered 
pair is represented by a point in the lattice of 25 points. Suppose now we 
wish to graph the sentence “x dominates y,” knowing that each person 
dominates those and only those whose names precede his on the roster. 
The graph would appear as in Fig. 5-1 if we choose the points on the 
axes as Indicated. 


Figure 5-1 


Abbreviating “dominates” by “D,” graph x D y in the following cases. 
(e) Each person dominates those and only those whose names follow his on the 
roster. (ἢ) The only dominations aree Da,e Db, andeDd. (g) Only a 
dominates and he dominates all others. (h) Only a is dominated and he is 
dominated by all others. 
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Consider the sentence x = y. If the range of the variables is Re, its 
graph is the straight line bisecting the angles between the axes in the first 
and third quadrants. If the range of the variables is G, the graph is the 
set of ten points {(0, 0), (1,1),..., (9, 9)}. If the range of the variables 
is the set of five people considered above, then the graph 15 the set of 
ordered pairs {(a, a), (b, δ), (c, c), (ἃ, d), (e, e)}. Evidently the set defined 
by a sentence depends on the ranges of the variables. If no indication is 
given as to the ranges, the set {(z, y)|y = x} is the set of all ordered 
pairs of things about which we are talking and such that the two com- 
ponents are the same. 

To have a convenient way of indicating the ranges of the variables, we 
define the truth set of a sentence s(x, y) in A X B as the set of all solutions 
of s(x, y) that are members of A X B. This makes the range of x a subset 
of A and the range of y a subset of B. 


(2) Truth set s(x, y)in AX B= {(2,y)| s(t, y)} N(A X B). 


Then we can characterize the plane cartesian graph of a sentence as its 
graph in Re X Re. More generally, the graph of s(x, y) in A X B is the 
geometric picture of its truth set in A X B. 

In practice, when we wish to graph a sentence in the cartesian plane, 
Re X Re, we find a few points of its graph in some subset of Re X Re, 
most often J Χ Re, where J = the integers, and then link these points 


Y 


—3 -2 -- 
(0, 0) 


FIGURE 5-2 
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by a smooth curve or in some other way that appears plausible. Tor ex- 
ample, to graph y = x” in the cartesian plane we may locate all its points 
in {—3, —2, —1,0,1, 2,3} Re, as in Fig. 5-2, and then draw the indi- 
cated smooth curve. 


Graph the following in {—3, —2, —1,0,1, 2,3} X Re and then link the 
points with a smooth curve or shade a region to suggest the graph in Re X Re: 
(i) y = 23, (j) y = Wz (in this one include a few more integral values of 2), 
(k)ce>y (y=1/r, (My<i/e, (ὦ) y= “5-ΕἘ1. 


PROBLEMS 


In Problems 1 through 4 sketch cach cartesian product in the plane. 


1. {0,1} Χ {0, 1}. FPA ὃ TK C10, 1, 
3. {0,1} Χ {0, 1, 2}. 4. {0,1,2} X {0,1}. 


δ. Show that 4Χ B = BX Ais not a law. 
6. Under what conditions does 4 Χ B = BX <A? 
*7. Prove that 


(3) AK (BNC) = (ΙΧ BNC! X& C). 


*8. State and prove as many other laws as you can about the cartesian product. 
*9. Argue that the number of members of A X B is the product of the num- 
ber of members of A and the number of members of B. 


ANSWERS TO EXERCISES 


(a) {(1, 4), (1, 5), (2,4), (2, 5), (3,4), (8, 5)}. (Ὁ) {(0, 0), (0, 1), (0, 2), 
(1,0), (1,1), C1, 2), (2,0), (2,1), (2,2)}. (ὦ The right member makes no 
sense unless A and B are sets. (d) (3-3-3). (6) through (h) See Fig. 5-3. 
(1) {(—3, =27), {--2 ==8); (=A, =1); (0, 0), (1, 1), (2, 8), (3, 27)}. 4) See 
Fig. 5-4. Note that only one value of y corresponds to each value of z, and 
that there are no values corresponding to negative values of x. (k) All points 
below y = x on lines x = —3, 5 = —2, etc. (1) Sce Fig. 5-5. (τ) See 
Fig. 5-6. (n) Fig. 5-2 with graph shifted up onc unit. 


y y 


mao τὸ: 
aro go 
Qe ao Qs 
REwroanaes 


wc 


abede arr a ae abcde 
(f) (g) (h) 
Figure 5-3 
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(9, 3) 


FIGURE 5-4 


Figure 5-5 FIGURE 5-6 


ANSWERS TO PROBLEMS 


1. Four points. 3. Six points. 5. Choose A # B. 
7, AX(BNC) ={(z,y|ztE AA YEBNC = (, yY|x EAANYEBA 
yEC} ={@Wi/@EAANYVEB A @EAN YEO} = (αν σεαλ 
yEBIN{(zy[eEAAYEC} =(AXBN(AXOC). 


5-3 Binary relations. As indicated in Section 5-1, we call any set of 
ordered pairs a relation. More specifically, we call any subset of A x B 
a relation in A X B. Relations of this kind are often called binary to dis- 
tinguish them from relations involving three or more objects, but we 
usually omit this adjective. Any set of points in the plane is a geometric 
picture of a relation in Re Χ Re. We call a relation in A X A a relation on 
A. Since any sentence s(z, y) defines a subset of A X B, where A and B 
are the ranges of the variables in the sentence, any sentence in two variables 
defines a binary relation. 

Consider, for example, the sentence x < y. This defines the relation 
{(x, ν}[1 < y} on the real numbers. It seems natural to use < as a name 
of this relation, i.e., to write that < ={(x,y)|« < y}. Consistent with 
this we adopt the following definition. 
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(1) Def. p = i(x,y)|z ρ y}. 


This is really a definition form, since we may substitute for p (the Greek 
rho, pronounced like “roe”) any constant such that x p y is a sentence. 
Of course other substitutions would yield nonsense on the right, and 
hence (1) does not mean that anything is equal to a set of pairs. 


(a) List those symbols introduced so far that are significant substitutes for 
pin (1). (b) Describe the relation named by each symbol in Exercise (a). 
(c) Suggest additional significant substitutes for p in x p y. 


According to (1) and (4-2-2), 
(2) (a,b)Eepoapb. 


We say that a is in the relation p to ὃ (a p ὃ) if and only if the pair (a, ὃ) 
belongs to the relation p. For example, [(3, 2) Ε >] < [8 > 21. 


Use € to write a sentence equivalent to each of the following: (d) 3 < 4, 
(0) 222, (ff) » -5 4, (8) x//y. 
Even if a sentence is not in the form  ρ y, it defines a binary relation 


to which we can assign a name. For example, consider the sentence “y 
is a multiple of x.” Using the abbreviation «|y, we give the precise 


definition 
(3) Def. aly=[teJ AYES A 3 σευ Ay = 21]. 


Then x|y < (2, y) Εἰ. We may read 2x[y also as “x divides y,” “x is a 
factor of y,” “y is divisible by x,” or “x is a divisor of y.” 

The graph of a relation p in A Χ B is just the graph of the sentence 
xpyin AX B. For example, the graph of > in Re X Re is the set of all 
points satisfying + > y. On the other hand, the graph of > nG@xXG 


is the set of points sketched in Fig. 5-7. 


Y 


(1, 0) (9, 0) 
Figure 5-7 
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Graph the following relationsinG X G: (h) S, (i) <, @) Ὁ, (ἡ =, 
(1) | (divides). 


For a given value of 2, the sentence x p y may have one, several, or no 
solutions for y. For example, let Br be a relation on people defined by 


(4) x Bry Ξξα 15 a brother of z. 


Then x Br y has several, one, or no solutions for y, depending on whether 
x has several, one, or no brothers. We call any value of y such that x p y 
an tmage of x under p. 


(5) Images of x under p =a {y|x p y}. 


We see that for a given relation a value of x determines a set of images 
(which may be the null set). 


Describe the images of 4 under each of the following relations: (m) =, 
(n) >, (0) the relation defined by y? = xz, (p) p defined by Fig. 5-8. 


The set of all objects that have at least one image under a relation p 
(i.e., whose set of images is not empty) is called the domain of p. The set 
of all objects that are images under p of some object is called the range of 
p. If p= {(2,y)|z p y}, the domain consists of values of “x” and the 
range consists of values of “y.” The domain consists of first components 
of pairs belonging to p and the range consists of second components. 


(6) Def. Dom (p) = [2 |3ν x p y}, 


(7) Def. Rge (p) = {y|sz x p y}. 


Figure 5-8 
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Let p be defined by a sentence s(x, y) so that 
siz, y) A Zpy (x,y) Ep. 


Then the domain of p is a subset of the range of significance of x in s(x, y), 
and the range of p is a subset of the range of significance of y in s(a, y). 


(q) For p shown in Fig. 5-8 what are the domain and range? Find the images 
of each member of the domain. (1) Answer the same questions for o (sigma) 
in Fig. 5-8. (8) Find the domain and range of Br defined by (4). (t) Do 
the same for the relation in (1). (ἃ) Do the same for the relation in Exercise 
(0). 

Familial relations are useful as illustrations of concepts discussed in 
this and following sections. Accordingly we add to the list begun with 
(4). The range of significance of the variables is understood to be persons 
born since 1800 A.D. 


(8) xSiy =ayisa sister of x. 

(9) xSoy =ayis a son of x. 

(10) x Day =ay is a daughter of x. 
(11) xPay =ayis a parent of z. 
(12) xFay =gy is the father of x. 
(13) x Moy =a y is the mother of x. 
(14) x Huy =qay is a husband of z. 
(15) x Wiy =ay isa wife of x. 

(16) xChy =qayisa child of z. 
(17) xSby =ay isa sibling of x. 


(v) Define each relation in (8) through (17) as a set of pairs. (w) Specify 
the domain and range of each. (x) For each relation describe the images of 
an element of the domain. (y) Why did we say “the” in (12) and (13), but 
“a” elsewhere? (2) Define each relation by an idiomatic sentence whose 


subject is x instead of y. 
PROBLEMS 


Suppose we have two propositions called 0 and J, the first false and the 
second true. In Problems 1 through 6 graph the relations in {0,1} Χ {0, 1}. 
Give their domains and ranges and find images of 0 and 1 under each one. 


lA. 2, V. = 4, ©, 5. VV. 
*6. / (See Problem 23 in Section 2-5). 
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*7. What other relations are possible in {0,1} X {0,1}? Treat each one as 
you did those in Problems 1 through 6. Suggest meanings in terms of logic. 


In Problems 8 through 17 graph the relations in A X Re, where A = {—9,. 
—8, —7, —6,...,—1,0,1, 2,..., 9}. 

8. I defined by 1 = {(z,y)|y = x}. 

9, I? defined by I? = {(2, y)|y = x7}. 

10. {@ yly =a2+t1V y = 1.- 2). 


11, @, 4) |e y = 8). 
12. {(z, ψ) 3 + y? — 24y — 5 = 0). 
13. [(ὦ, y)|y = lal}. 
14. {(2, y)|a = [0]}. 
Ww15. {(2, y)|y = sin 2}. 


16. {(z,y)|y = tanz}. 
*17. {(2,y)ly = cos x}. 


18. Describe the domain and range of each relation listed under Exercise (a). 


ANSWERS TO EXERCISES 


(a) - Λ, Μ, το ὑπο Μ, ΞΘςςξ, //; % <, >; «, >, A. (b) For example, 
= is the set of ordered pairs whose members are identical. (0) Any verb, 
e.g. “loves,” since “zx loves y” defines the relation of loving. (4) (3,4) © <. 
(6) 226E€>. (ὃ ῳ,4., ΞΕ -ο.«. (g) @y) Ε “]. (h) All grid points not 
in Fig. 5-7. (4) Like (8) but less points on diagonal. (7) Fig. 5-7 plus 
points on diagonal. (k) Ten points on diagonal. (1) See Fig. 5-9. (m) {4}. 
(n) {yly > 4}. (0) {2, —2}. (p) The empty set. (α) Dom (p) = 
(1, 2, 3, ὃ, θ), Rge (p) τ ι0, 1, 2, 3, ὅ, θ). (r) Dom (σ) ἘΠῚ 0, 1, 2, 3, 4, 6}, 
Rge (0) = {0, 2, 4,6). (8) The domain is the set of all men with brothers, 
the range 15 the set of all males with siblings. (t) Domain is the digits 1 
through 9, range is G. (u) Domain is non-negative reals, range is reals. 


me wow fF oO DO N ὦ © 


~_ 
— 


0 1 2 38 4 ὃ 6 7 8 9 


FIGURE 5-9 


5-4] FUNCTIONS 259 


ΠῚ y y | 
] e e Φ 
Λ V Ex 
ΜΗ x xv 
1 
y y y 
Φ 
as Vv i 
x x x 


Figure 5-10 


(v) For example, Si is the set of ordered pairs whose second components are 
sisters of their first components. (w) Rye (Da) = women, Dom (Da) = men 
and women who have at least one daughter. Rge (Pa) = parents, Dom (Pa) 
= people. Rge (Fa) = fathers, Dom (Fa) = people. Rge (Mo) = mothers, 
Dom (Mo) = people, Rge (Hu) = husbands, Dom (Hu) = wives. Rge (Wi) = 
wives, Dom (Wi) = husbands. (x) For example, the images of x under So are 
the sons of x. (y) “The” applies only where there is uniqueness. (z) For ex- 
ample, + Pay = xisachild of y; andzSo y = zis a parent of y and y is male. 


ANSWERS TO PROBLEMS 


1 through 6. See Fig. 5-10. 7. Ten others, including the empty relation (no 
points) and the universal relation (all four points). 9. Nineteen points. 11. 
Nineteen points. 13. Nineteen points arranged in a “V.” 15. Remember, z is 
In radians! 


5-4 Functions. We have seen that an object in the domain of a relation 
may have one or several images in the range. For example, in a monoga- 
mous society a person has at most one wife, so that each object in the do- 
main of Wi has one and only one image under Wi. On the other hand, a 
person in the domain of Br (i.e., a person who has at least one brother) 
may have several images (brothers) under the relation Br. As another 
example, consider the relation defined by y = x? with domain the real 
numbers and range the non-negative real numbers. Each number in the 
domain has just one image, its unique square. On the other hand, consider 
the relation defined by x = y? with domain the non-negative real num- 
bers and range the real numbers. For each nonzero value of x there are 
here two values of y that satisfy the equation, so that each object (except 
zero) in the domain of this relation has two images in the range. Finally, 
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under the relation < each real number has an infinite number of images, 
since for any x, the sentence x < y has infinitely many solutions for y. 

Relations such that every object in the domain has just one image are 
called functions. This means that a relation p is a function if and only if 
for any x in Dom (p), the set of images of x under p is a singleton. Letting 
“Fun” stand for the class of functions, we may write formally 


(1) Def. p © Fun = γα [x € Dom (p) > Jly z p ψ]. 


Since each object in the domain of a function has just one image, this 
unique image is determined as soon as the object in the domain is specified. 
Obviously, functional relations are of great importance whenever we have 
a situation in which one thing determines another. Indeed, historically 
the function concept preceded the more general concept of relation, and 
the idea of determination usually dominates the idea of relationship. 

Since each object in the domain of a function F determines a unique 
image, and since we often wish to speak of the image of a particular object 
or of an unspecified object, it is convenient to have a special symbol for 
the image of the object x under the function F. This symbol is provided 
by the functional notation, which we have been using since Chapter 2. 
Formally, we may introduce this usage as follows. 


(2) Def. F(x) = yo a F y. 


Note that the right member of the definition makes no sense unless ἢ 
is a function. It may be read “the y such that x F y,” “the y that is an 
image of x under F’,” or “the image of x under F’.” This is a definition form 
that gives meaning to “F(x)” whenever F is a function and x € Dom (F). 
We have immediately 


(3) [ly = F(x)] @ [x F yl, 

(4) F= {(a,y)ly = F(a)}, 

(5) a F F(z), 

(6) | (x, F(x)) € F, 

(7) F = {(x, F(x))|x © Dom (F)}. 


Only functions are significant values of F in these laws. This is suggested 
by using the variable “Ε΄, although any letter may be used. 

We can now give meaning to the letter “f” in the functional notation 
“f(x)” introduced in Section 2-9. If “f(x)” is a formula, “f(~)” has a unique 
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value corresponding to each value of “x,” by our definition of the word 
“formula.” Hence the sentence “y = f(x)” defines a function whose domain 
is the range of significance of “x” in “f(x)” and whose range is the range 
of “f(z).” Then by (4), “f” is a name of this function. Hence we were 
speaking quite precisely in Section 2-9 when we said that “the reader may 
think of it as standing for the relation between the values of x and the 
corresponding values of the formula.” 


Which of the following relations are functions? (a) Mo, (0) Da, (c) Hu, 
(d) Pa, (6) pin Fig. 5-8, (f) σ in Fig. 5-8. 
Read in words: (g) Mo (4), (8) Fa (John), (4) o(8) in Fig. 5-8. 


We may visualize a function in a great variety of ways. All these go 
back to the fundamental fact that a function is a set of ordered pairs in 
which no two different pairs have the same first component. This is obvious 
if we recall the definition of a relation as a set of pairs and observe that 


in (1), [Alyx py] [Aly (a, y) € pl. 


Which of the following are functions? (7) {(0, 1), (1, 1)}; = ¢k) {(0, 0), 
(0, 1)(4, 0}}; () {(0, 1), G, 1), (2, 1}. 


Evidently a function can be specified by giving the roster of its members, 
provided there are only a few members. A more general way to specify 
a function is to give a defining sentence that defines the set of ordered pairs. 
(Compare Section 3-2.) This may be done explicitly by a sentence of the 
form y = f(x) or implicitly by a sentence s(x, y) of some other form. For 
example, y = x? and x? — y = Ὁ define the same function. The first 
gives an explicit formula for the image of each x. The second gives the 
image of each x implicitly, since it has to be solved for y in order to find 
the image in each case. When a function is defined by a sentence of the 
form y = f(x), “f” is a name of the function as indicated by (4). If 


Va | [3y s(z, y)] - [ly s(z, y))], 


then {(z, y)|s(x, y)} is a function. 


In each of the following, solve for y so as to get a sentence defining the func- 
tion explicitly: (m) y—zx=0,and (n)y — 21: -Ε 1 = 0. 


A third way to specify a function is to give a formula for the image of 
any object in the domain. Thus we could define the function discussed 
above by writing f(x) = x”. Of course, it may not be convenient to 
write a single formula that gives the image of every object in the domain. 
All that is necessary is that some rule be given from which we can calculate 
the image of any object. For example, we might write f(z) = x? for 
« > Oand f(z) = rforr < 0. Thenf = {«,ψὺ'’), > OA y = 2?) Vv 
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( < 0A y= 2)}. We call f(x) a defining formula of f. When a defining 
formula f(z) is known, the function may be specified as in (7) without 
using a second variable. . 


(0) Write a defining formula for the function defined by the sentence y? = 
zA\ y= 0. (Ὁ) Do the same for “x was sired by y.” 


Since functions are relations of a special type, the discussion of the 
previous section applies. For example, the domain of a function defined 
by y = f(x) is the set of values of x for which f(x) exists, and the range is 
the set of values of y for which y = f(x) for some x. We have discussed 
functions by using x to stand for objects in the domain and y to stand for 
those in the range. Of course, other letters could be used. In any case, the 
variable used to stand for objects in the domain of a function is called the 
independent variable, and the variable used to stand for objects in the 
range is called the dependent variable. The terminology refers to the fact 
that the value of the dependent variable is determined by that of the 
independent variable. 


Consider the function f defined by “f = {(z, y)|a has the army serial number 
y}.”  (q) What is the domain? (Ὁ) What is the range? (s) Write an ex- 
plicit formula for f(z). (t) Which is the dependent and which the independent 
variable? 


It is often convenient to visualize a function as the set of points cor- 
responding to the pairs that make up the function. The graph of a function 
f is simply the graph of the sentence y = f(x). Graphs of functions have 
one special feature not possessed by those of other relations: no vertical 
line crosses the graph of a function in more than one point. This is the 
geometrical counterpart of the fact that only one value of the dependent 
variable corresponds to each value of the independent variable. For ex- 
ample, Figs. 5-2 and 5-4 are graphs of functions, whereas Figs. 5-1 and 
5-6 are not. 


-(u) Some relations in Re X Re are sketched in Fig. 5-11. Which are 
functions? Write a defining sentence for each. Indicate the range and domain. 


PROBLEMS 


1. If F = {(1, 0), (—3, 4), (0, 0), (2, 0), (8, —4)}, specify Dom (F), Rge (F), 
F(—38), F(0), F(2). Sketch the function. 

2. Do the same for G = {(—2, 4), (—1, 1), (0, 0), (1, 1), (2, 4)}. Suggest 
a defining formula and define G by using it. 

3. Let 1? = {(z,y)|y = 2? A «x EJ}, where J = the integers. Specify: 
(a) 17(1), 13(2), Dom (I?), Rge (12); (b) three names for the function; (0) a 
defining formula for the function; (d) a defining sentence for the function. (e) 
What is the image of m under 12) (Ὁ What two numbers have the image 16 
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(a, ὃ) 


Y Y 


(2, 1) 


Figure 5-11 


under 1229 (g) If the independent variable has value 3, what is the corre- 
sponding value of the dependent variable? (h) Sketch 12. 
4. Graph the following sentences. Which define functions? x — y = 7; 
στε γῆ τπ 4; y=rAnx «0; μ{0] =z. 
5. Is it legitimate to write “P(x) = the point on the axis corresponding to 
the real number z,” and to consider P a function? Explain. 
6. Does “z is a name of y” define a set of pairs (x, y) that is a function? 
Explain. 
7. Can any expression serve as a defining formula for a function? Explain. 
8. What is the relation between the range of a formula involving just one 
variable and the range of the function it defines? between the range of the 
variable in such a formula and the domain of the function it defines? 
9. If a function is defined by a propositional formula, what is the nature 
of its range? its domain? 
10. If T(x) = the truth value of x, what is the domain and range of 7’? 
11. If f(x) = the digit in the twenty-fifth decimal place of the decimal ex- 
pansion of z, is f a function? What is its domain and range? 
12. What are the domain and range of the function defined by f(z) = ~2z? 
Describe the image of x under f. 
*13. Let T(f) = the truth table of f. Is T a function? What is its range and 
domain? 
Ἃ14. If T(z) = the truth value of z, what can we say about the range of the 
function defined by T(f(zx)) if f(x) is a law? 
15. Does “y dislikes x” define a function? Explain. 
16. Does “y is married to x”? Explain. 
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* 17. Mention some functions whose domains consist of sets. Give a defining 
formula for each and specify the domain and range. 

* 18. Specify a function whose domain is Re, the real numbers, and whose range 
consists of intervals. 

19. Mention a function whose domain is a set of intervals of real numbers 
and whose range is a set of real numbers. 

20. List several functions whose domain consists of ordered pairs of real 
numbers. 

21. Do the same for the domain consisting of ordered pairs of sets and the 
range consisting of sets. 

22. What is the domain of the function defined by y = max (2) ἢ 
_% 23. In the theory of games a strategy is defined as a rule for choosing a move 
in any situation that may arise in play. Describe a strategy as a function, 
indicating its domain and range. . 

24. Which of the relations in Fig. 5-10 are functions? 

25. Is any straight line the graph of a function? 

26. Is Fig. 4-12 the graph of a function? 

27. Comment on the following. “Let @(u) be the direction angle of u.” 

28. What is the domain of the function defined “by y = the x-intercept of 
the straight line L,” where L is the independent variable and y the dependent 
variable? 

29. Does y = sin z define a function? What are its domain and range? 

30. What are the domain and range of cosine? tangent? 

*%31. Let Qp(x) = the images of x under p. (a) What are the domain and 
range of Qe? (Ὁ) Under what conditions does the range consist of singletons? 
(c) What is the union of all the members of the range of Qp? (d) If p is a func- 
tion, is the range of Qp the same as the range of p? | 

«32. Reformulate Problems 17 through 21, using the idea of cartesian product. 


ANSWERS TO EXERCISES 


(a), (c), and (f) are functions. (9) The mother of x. (8) The father of 
John. (i) Four. (j) and (l) arefunctions. (m) y= 2 (ἃ) y = 2. —1. 
(0) y = Vz. (p) y = Fa(z). (α) Soldiers having serial numbers. (r) A 
set of serial numbers. (s) y = the serial number of z. (0) z independent, 
y dependent. (u) a, B, σ, 6. Dom (8) = (0 ὁ), Rge (6) = (ἀ ὁ). β = 
{ (x, y)| (x, y) lies on the graph in Fig. 5-11.} 


ANSWERS TO PROBLEMS 


1. Dom = (1, —3,0, 2,3}, Rge = {0,4,—4}, F(—3) = 4, FQ) 0, 
F(2) = 0. Graph consists of five points. 3. (a) J2(1) = 1, 13) = 4, 
Dom (12) = J, Ree (12) = perfect squares; (Ὁ) “I?,” “[(, y)|y = 227A xeJ},” 
“F(a, “23] 5 Ε JH”; (ὁ “x2”; (ἃ “y = a? A x EI”; (0) πῆ; (Ὁ 4 and —4,; 
(6) 9. 5. Yes, if an axis is assumed to have been specified, since just one point 
corresponds to each number. 7. No, the expression must be a formula, since 
it must be such as to have a unique value for each value of its variable. 8. Same 
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in both cases. 9. Range consists of propositions, domain is not limited by this 
information. 11. Yes; domain is real numbers, range is the digits. 13. Yes; 
domain is the set of logical formulas, range is the set of corresponding truth 
tables. 15. No, since often more than one person dislikes a given individual. 

17. F(x) = x’; domain the power set of some set, range the corresponding 
complements (range = domain in this case). 19. F(x) = the length of z; 
domain is intervals, range is non-negative reals. 21. F(z, y) = «NM y. 23. The 
domain is the set of possible situations; the range the corresponding moves 
specified. 25. No, not vertical lines. 27. Nonsense, since a vector has many 
direction angles. 29. Yes, domain is real numbers, range is (—1_1). 31. (a) 
Domain is the domain of p, range consists of sets of images. (b) When p is a 
function. (c) The range of p. (d) No, if p is a function the range of Qp is the 
set of all singleton subsets of the range of p. 


*5-5 Aspects of the function idea. The idea of one thing determining 
another is very old and very familiar. It is natural that this idea has been 
described in many different ways and appears under a variety of aliases 
in different fields. In Section 5-4 we have defined a function as a special 
kind of relation, that 15, as a set of ordered pairs in which each first com- 
ponent is paired with only one second component. The cartesian graph 
of a function is a set of points that is met by any vertical line in at most 
one point. The function may be defined by listing the pairs, by giving a 
defining sentence or defining formula, or by the graph. In this section we 
discuss various other ways in which functions appear and terms in which 
they are described. 

Tables. A function may be defined by a table that lists in some con- 
venient fashion the images of members of the domain. Thus, the first 
two columns in Table I in the Appendix define the square-root function 
with domain the first 100 positive integers. Or we may think of Table I 
as giving merely a partial roster of the pairs belonging to {(z, y)|y = “τὶ 
with domain the real numbers. Such partial tables can be used to find 
other images exactly or approximately by various devices. For example, 
we can find 2600 from Table I by noting that +/2600 = 10\/26, and 
we can estimate +/26.3 by linear interpolation as explained in Section 
4-11, 

(a) What function is defined by the first and third columns in Table I? 
*(b) Use Table I to find 86,000 and to estimate 51.6. 


Although we can tabulate only a finite number of pairs belonging to a 
function, we can think of a function as specified by an imaginary table 
giving all pairs. Each pair corresponds to a point on the graph of the 
function. Printed tables are then viewed as a selection, often (but not 
always) a selection of pairs with equally spaced values of the independent 
variable. 
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(c) What would be the appearance of a table of a nonfunctional relation? 
Have you ever seen such a table? 


Mappings. When we construct a map of a portion of the earth’s sur- 
face, each feature on the ground 15 represented by a corresponding symbol 
on the map. Letting f(z) = the point on the map corresponding to the 
point x on the ground, f is a function whose domain is the mapped portion 
of the earth and whose range is the map. By analogy we describe any 
function as a mapping that maps its domain onto its range. Schematically, 
as suggested in Fig. 5-12, we may think of the domain and range of a 
function as sets of points, and of the function as mapping each point in 
the domain into its image in the range. 


(d) Is it possible for a function to map two different points in its domain 
into the same point in its range? (6) Sketch {(z, y)|y = 22 A x © {—2, —1, 
0, 1, 2}} by representing the domain and range on two parallel.axes and draw- 
ing an arrow from each object in the domain to its image in the range. 


Correspondences. We have spoken of the image of an object in the do- 
main as the object that corresponds to it. With this in mind, relations are 
often described as correspondences. If the relation is a function, each ob- 
ject in the domain corresponds to one and only one object in the range, 
though more than one object in the domain may correspond to the same 
object in the range. Hence relations are called many-many correspondences, 
and functions are called many-one correspondences. 


(f) In the function defined by y = x”, how many objects in the domain 
correspond to each object in the range? (g) Answer the same question for 
the relation Pa defined by (5-3-11). 


Operations. When we find the image of x under the function ἢ, we per- 
form an operation on x. A function f is often described as an operation, 
namely, the operation that yields (5) when it is applied to x. For ex- 
ample, the operation of squaring yields x” when it is applied tox. We may 
think of f as a kind of machine that produces f(x) when wz 1s fed into it, as 
suggested in Fig. 5-13. This way of thinking is not usually applied to 


v 


F(x) 
FIGURE 5-12 Figure 5-13 
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nonfunctional relations, but it could be by thinking of the output of the 
machine as the set of images of x under the relation. 


(h) What is the operation defined by y = x? — 3? 


When the domain of a function consists of ordered pairs from some set A, 
and its range of members of A, the function is often called a binary opera- 
tion on A. For example, addition is a binary operation on numbers. When 
the domain of an operation on A consists of A Χ A, that is, when every 
ordered pair of elements in A yields a member of A under the operation, 
the set A is said to be closed under the operation. For example, the real 
numbers are closed under addition, but not under division, since division 
by zero is undefined. Sometimes a function Is called a unary operatzon on 
the members of its domain. , 

Transformations. Sometimes a function is described as a transformation 
that transforms each object in its domain into the corresponding object 
in its range. This can be visualized in terms of Fig. 5-12 or Fig. 5-13. 
When this terminology is used, the image of x is sometimes called the 
transform of x. 


(i) What is the transform of x under Fa? 


Traditional terminology. The above terms are really metaphorical ways 
of talking about functions. They are suggestive of the nature of functions, 
facilitate thinking about them, and provide alternative expressions for 
ideas that appear very frequently. They are picturesque ways of talking 
about the set of ordered pairs that 7s the function. However, there are 
many other common ways of talking about functions that are quite con- 
fusing, even though hallowed by long and continued usage. 

Frequently the symbols used to define a function and to represent 
members of its domain and range are used as names of the function itself. 
Below is a table of some such usages, with the meaning of each. 


Common usage Meaning 


The function f(z). The function defined by “y = f(zx).” 
The value of the function f(x) 


corresponding to x. 
The value of the function at z. f(x). 
y is a function of z. ify = f(z). 
f(x) is a single-valued function. f is a function. 
f(x) is a multi-valued function. f is a nonfunctional relation. 


The expressions in the left column must be viewed as idioms. Taken 
literally, they are nonsense. For example, in the fourth entry y is a vari- 
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able, not a name of a function (a set of ordered pairs). Very frequently 
in science one sees expressions such as, “The distance is a function of the 
time.” This means that the time determines the distance, i.e., that there 
is a function f such that y = f(x) gives the distance corresponding to the 
time x. It should be noted that when we say “determine” we are not as- 
suming causality, but merely a correspondence (or pairing) of times and 
distances. The function idea does not imply causality. A function is a 
pairing, not an explanation of the pairing. We are perfectly free to pair 
members of any two sets. 

The reader will not become confused with the variety of terms if he 
keeps in mind the essential features of a function as a set of ordered pairs 
m1 which no two second elements are paired with the same first element. 
Only when it comes to talking about functions in various contexts does 
the profusion of terms appear. We usually have an independent variable 
to refer to members of the domain of the relation, a dependent variable to 
refer to members of the range, and a formula, rule, sentence, table, or 
other means of specifying the second component corresponding to each 
first component. The function itself may be called a mapping, corre- 
spondence, operation, or transformation. The values of the variables 
may be referred to as points, objects, or components. The object in the 
range corresponding to a member of the domain may be called the image, 
the corresponding element, the transform, or the value of the function. 
Some of the terminology may suggest that the formula or rule zs the func- 
tion, but this is merely a way of speaking. 


What is meant by the following? (j) “The function «ἢ.  (k) “The work 
done in moving a given distance is a function of the force applied.” (1) “The 
value of the function o defined by Fig. 5-8 is 6 when x = 4.” 


PROBLEMS 


1. Discuss photography in terms of the function idea. 

2. Is a one-to-one correspondence a function? 

3. Suppose we view a function in Re X Re as mapping the x-axis into the 
y-axis. If we are given the graph of the function, show how to determine the 
image on the y-axis of any point on the z-axis. 


Graph the relations in Problems 4 through 12 by drawing parallel axes and 
using arrows as in Fig. 5-16. In the case of more than a finite number of pairs, 
give a few typical arrows. 


4. p of Fig. 5-8. 5. σ of Fig. 5-8. 

6. The relation of Fig. 5-9. 7. The relations of Fig. 5-10. 
8. {(z,y)|y = «+ 3}. 9. {(z,y)|y = 20}. 

10. {(z,y)|y = —2}. 11. {(z,y)|y. = 2}. 


1 — x}. 


Il 


12. {(a,y)ly 
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13. Describe the relations in Problems 4 through 12 as mappings. 

14. Describe them as correspondences. 

15. Describe them as operations. 

16. Describe them as transformations. 
#17. “One quantity, or measurable thing y, is a function of another measurable 
thing x, if any change in z will produce or ‘determine’ a definite corresponding 
change in y....” (The Alphabet of Economic Science, by Philip H. Wicksteed) 
Discuss. 
*18. “Since the statement y» = f(x) implies a definite relation between the 
changes in y and the changes in 2, it follows that a change in y will determine a 
corresponding change in x as well as vice versa. Hence if y is a function of z, 
it follows that x is also a function of y.” (Same reference) Discuss. 

Ba Figure 5-14 graphs the function f. Sketch y = —/f(x), y = f(—2), 

= |f()|, y = f(z). 

” 90. Do the same for g, defined by Fig. 5-15. 
*21. Discuss the function partially tabulated in the first and fourth columns 
of Table I. Give a precise definition of the function. What are its domain 
and range? Find the image of 8.7 and of 87.2 under this function. Use the 
table to find 1/35. 


ANSWERS TO EXERCISES 


(a) Cube root with domain the first 100 positive integers. (b) 44.14; 3.73. 
(c) More than one entry in the body of the table for some entry in the margin. 
(d) Yes. (6) See Fig. 5-16. (ἢ) Two, except that only one corresponds to 0. 


(=; 0) 


(2, 0) 


Figure 5-14 Figure 5-15 


—2 —I1 0 l 2 
Fiegurn 5-16 
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y = f(—2) 


x 
(2, 0) (—2, 0) 


y = |f(x)j y = f(x) 
FIGURE 5-17 


(g) For any ἡ, as many as the children οὖν. (h) Squaring z, then subtracting 
3 from the result. (i) The father of x. (j) {(z,y)|y = 23}. (k) Thereisa 
function whose members are ordered pairs whose first components are forces 
and whose second components are the corresponding work done by them. (I) 
o(6) = 4. 


ANSWERS TO PROBLEMS 


1. The image of an object is the mark on the print corresponding to the 
object photographed. 3. From (z, 0) on the z-axis go vertically to the graph, 
then go horizontally to (0, f(z)) on the y-axis. 13. For example, the function 
in Problem 8 maps each real number into one three greater; it shifts all points 
three units to the right. 15. The same function is the operation of adding 
three. 19. See Fig. 5-17. 21. Domain and range consist of all nonzero reals. 
1000/8.7 = 114.9; 1000/87.2 = 11.46; 1/35 = 0.02857. 


*5~-6 Set algebra applied to relations. Since relations are sets of ordered 
pairs, all the concepts and laws of Chapter 3 are applicable. We may con- 
sider a cartesian product A X B and its power set, that is, the set of all 
relations in A X 3. Among these is the null relation (the relation that 
holds between no two objects chosen from A and B) and the universal 
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relation, A X B itself, which holds between the members of any ordered 
pair in A Χ B. Then the definitions and laws of Chapter 3 apply. 

In particular, if p is a relation, we call p’ the complementary relation. 
We have 


(1) ρ΄ = {(z,y)|~@py)}, 
(2) ( ρ΄ y) > ~( py). 


Often the complementary relation is indicated by drawing a diagonal line 
through a name for the relation. For example, we write τέ for =‘ and 
¢ for €’. 


(a) Read in idiomatic English: 3 <’ 8, α Ε΄ B, SG’ T,a//’b. (Ὁ) Iden- 
tify the graph of the complement of — in your solution of Problem 3 in Section 
5-3. (c) Does —’ have another name? (d) Describe So’ where So is defined 
by (5-3-9). What is its domain? range? | 


The union of two relations p and @ is the relation p U σ that holds when- 
ever either or both hold; that is, 
(3) [τ (ρ Uo)y]) > [zx py V xo yl. 


The intersection of two relations is that which holds for an ordered pair 
whenever they both hold; that is, 


(4) [x(pno)ylolr py Axa yl. 


One relation p is a subrelation of a second σ᾽ if and only if the second holds 
for any ordered pair for which the first does; that is, 


(5) (ρ Ξ- σὴ «» (Vayrpyoxay). 
Two relations are disjoint if and only if they cannot hold for the same 


ordered pair. 


(e) True or false? >C>; < C=; =C<._ (ὃ Explain in words the 
meaning of C, //, {, and U for relations. (g) Under what conditions is p = σ 
where p and σ᾽ are relations? (h) Give examples of disjoint relations. 


Consider the relation S defined by y”? = zx. It is not a function, since 
there are two images of each value of x unless x = 0. However, since 


2 


y= 2r)e(y= νὰν y= --ν), 
δ τε {@,yly = Vz} ὦ {@,y) ly = -- νϊ. 
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y 


(4, ---2) 1112 U pV2 
y= Ve yr =x 
Figure 5-18 


Hence S is the union of two functions. In Fig. 5-18 we have sketched 
y = V2, y = -- να, and y* = z. 


(i) Show that {(z, y)|a?+ y? = 4} = ἰ(, ν)ῖὴν = V4 — 27} U 
{(v,y)|y = —V4 — 2x7}. Sketch each function and their union as in Fig. 
5-18. (7) Express the relation of Fig. 5~9 as a union of functions. (k) Argue 
that any relation is the union of functions. (1) When we wish to determine 
the relation AM B, where A is defined by s(x, y) and B by r(a, y), we refer 
to this as ______ the ________ sentences s and r. (Fill in the blanks.) 


PROBLEMS 


Complete the sentences in Problems 1 through 12. 


1. Fa U Mo = ? 2. Hu U Wi = ? 3. Bro = 2? 

4,.(<U=) =? 5. Hun Wi = ? 6. Fa NM Pa = ἢ 
7. BruUSi = ἢ Ὁ: τἀ ee? 9. SoU Da = ? 
ΠΥ St 11. ("ὦ 69) =? AUR = 7 


13. Graph x? + y? = 9 as the union of two functions. 

14. Show that any circle is the graph of the union of two functions by showing 
that the equation of any circle is logically equivalent to the disjunction of two 
sentences defining functions. 


15. Do the same for (x — 3)? + (y+ 2)? = 25. 
Graph 16 through 20 as the unions of functions. 
16. ly] = x 17. ν' = 5. 18. [ν — 3] = 2Qz. 
19, y? = 2. 20. (ψ — 3)(y — x) = 0. 
21. Prove {(2, y)|f(a, y)g(a, y) = 0} 
= {(z, y) f(z, y) = ΟΣ U {(z, ψ))σία, y) = 0}. 
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Graph 22 through 25. 


22. xy = 0. 23. (4 --- y(«+y) = 
24. (1 -ἰ 2y — 1)8x-+y) = 0. = 25. (x? — y?)(y + 2) 


26. Prove {(z, y)|[f(a, y)]? + [g(a, yl? = 0} 
= {(z, y) f(z, y) = 0} ἢ {(z, yl g(a, y) = O}. 
Graph 27 through 30. 
27. 27+ y? = 0. 
28. («x — γ)5 + (a+ y)? = 0. 
29. (x + ὃν — 1)?+ (324+ y)? = 0. 
30. (2? — y?)? + (y+ 2)? = 0. 


31. Consider the relations =, #, <, >, <, and > on the real numbers. 
Write as many true statements as you can about them, using the ideas of the 
set operations and relations. For example, # = =’, <C<,and<N2>e=.... 


% 32. Do the same for the binary relations on sets, GC, C, %, //. 


*33. Do the same for the logical relations, V, A, -, «Ὁ», V. 
%34. Do the same for the familial relations. 


0. 
= 0, 


ANSWERS TO EXERCISES 


(a) 3 is not less than 3, a is not a member of B, S is not a subset of TJ, a and ὃ 
are not disjoint. (Ὁ) {(1,0)}. (ὁ No.  (d) Set of all ordered pairs of 
people in which the second component is not a son of the first; range = 
domain = set of all people. (e) T, F,T. (f) For example, p Ca if and only 
if whenever x p y then x a y but there is an (z, y) such that zo y but 2 p’ y. 
(g) zpytauay. (ἢ) <, >; 4, o for any a; etc. (i) Use (2-8-21) 
(x:y, a:1, b:0, c:z? — 4) or subtract x22 from both members and use a? = 
boa =bVa=—b. (j) y = 02) V Y=2)V (y = 22) V (y = 32) V 
(y = 4a) V (y = 52) V (y = 6x) V (y = 7x) V (y = 8x) V (y = 92). 

(k) We construct one of the functions by picking out one of the images of 
each object in the domain, a second by picking out another, etc. In this way 
we exhaust all images if the number is finite. If not, practical difficulties may 
arise, but we can still imagine the process as accomplished. For example, we 
can imagine x > y as the conjunction of all sentences of the form + = y+k 
where k takes all real positive values. (1) solving; simultaneous. | 


ANSWERS TO PROBLEMS 


1. Pa. 2. Spouse. 3. Not brother. 4. <. 5. Empty. 7. Sb. 9. Ch. 11 —. 
13. Upper and lower halves of circle. 15. y = —2 & V25 — (x — 3)?. 
Wy = tVex. 19 y = ναῦν ψ = —V 23. 21. (2-8-5), 
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5-7 Linear functions. A function that has a defining formula of the 
form mz + ὃ is called a linear function. We studied the geometric proper- 
ties of the graphs of such functions in Section 4-5. We adopt mI + bas 
a name for the function defined by y = mz -+- ὃ; that is, 


(1) Def. mIl+b= {(az,y)|y= me+bAceE Re}. 


(a) Describe 27 + 3. 


A special class of linear functions consists of those for which m = 0. 
We define 


(2) Def. b= OJ + ὃ. 


The range of ὃ consists of the singleton {δ}, since the image of any x is 
just b. We call any function whose range is a singleton a constant function. 
The name suggests the idea that the image is the same for all objects in 
the domain. 


(b) Describe 3. 


Another special class consists of those linear functions for which b = 0. 
We define 


(3) Def. mI = mI + 0. 


We call these functions direct variations. Their graphs consist of straight 
lines through the origin. Among the direction variations is the negative 
defined by 


(4) Def. ἘΠ; 


(c) Describe the graphs οἱ —J, 21, --8]. 


Note that “J” is not a variable. On the contrary, 
(5) Def. «T= {(@yly= x Axe Re}. 


It is a constant standing for the set of all ordered pairs of real numbers 
whose components are identical. We call 1 the tdentity function. The 
expressions of the form mJ + ὃ used to designate linear functions have 
the advantage that a defining formula is obtained by substituting an 
independent variable for the constant J. We shall use this name of the 
identity function extensively for naming other functions. 

A function whose graph consists of a straight line or of the union of por- 
tions of straight lines is called piecewise linear. Piecewise linear functions 
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appear very frequently in science. As indicated in the problems following 

‘Section 4-5, a defining sentence of a straight line segment consists of the 
conjunction of a linear equation and an inequality requiring the inde- 
pendent variable to lie in an interval. Hence a defining sentence of a 
piecewise linear function consists of the disjunction of such sentences. 


Graph the functions defined by the following: (d) (y=x AOS 25 2)V 
y=8A2<2<3)V Y= —z+6A z= 3); (0) f(x) =1lfor0< 2 < 2, 
f(z) = 2for2 < x « 3,f(z) = lfor4d < x < 5,andf(z) = 3for6 < z < 8. 
(Ὁ Write Exercise (d) in the form of (6). (9) Write Exercise (0) in the form 
of (d). 


Y 


100 


0 12 3 4 5 6 


FIGURE 5-19 FiGurE 5-20 


Figure 5-19 is similar to several appearing in “The Theory of the 
Neural Quantum in Discrimination of Loudness and Pitch,” by S. 8. 
Stevens, G. T. Morgan, and J. Volkmann in the American Journal of 
Psychology, July, 1941. The variable x represents the percentage of cases 
in which a person discriminates this pitch as different from a comparison 
pitch corresponding to x = 0. Estimating the position of the corners, we 
have the following defining sentence of this functional relation: 


(4<25Ay=0) V (2.5 <2 < 45 A y = 50v —-125) 
V (# > 4.5 A y = 100). 


(h) Explain the equation y = 502 — 125. (i) Find the defining sentence 
if the corner points are (2.1,0) and (4.5,100). Graph. From the graph and 
Fig. 5-19, discuss the reactions of the two individuals. 


The norm. The norm function is defined by y = [1]. (See Section 3-9.) 
Its graph is suggested in Fig. 5-20. We designate it by |/| so that 
[Z|(x) = |e. | | | 


(6) Def. IZ] = {@, y) ly = [5}}.ἔ 
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(j) Write a defining equation of the norm by completing the following: 
Fe fe a Se Ky, (k) Graph y = {2z]. ° 
(1) Show that [Z(z)| = [1]. 


The signum function. The signum function, sg, is defined by 
Y=-1LAt<0DVYH=DOALZH=NVY=1AZx> dD). 


It is graphed in Fig. 5-21. 


(m) Find sg (0), sg (1), sg (—1), sg (10), sg (—3). (n) Argue that 
a-sg (x) = [1]. 


Step functions. Step functions are those whose graphs are made up of 
horizontal line segments and, possibly, single isolated points. The signum 
function is an example, since its graph consists of two infinite horizontal 
segments and the origin. Step functions appear very frequently. For 
example the postage charged on books is related to their weight by the 
step function graphed partially in Fig. 5-22. Here y is the postage in 
. cents, and z is the weight in pounds. Note that the right-hand endpoints 
are not included in each step. 


FIGURE 5-21 FIGguRE 5-22 


(0) What is the domain of the above function? Its range? (Ὁ) Graph the 
postage required on a first-class letter as a function of its weight. 


A convenient function for dealing with step functions is the untt step 
function [I] defined by y = [zx], where [5] = the largest integer less than 
or equal to x. Formally, 


(7) Def. [ze] = max [22 εὖ Az < 2}. 


The unit step function is graphed in Fig. 5-23. Note that each segment is 
open on the right and closed on the left. 
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FIGURE 5-23 


(q) Find [1.5], [1.7], [2], [0], [—1.6], [—3]. (ὦ Graph y = [22]. (8) Graph 
y = 2[1]. (Ὁ Argue that Fig. 5-22 is the graph of y = 9-+ ὅ[.. (ἃ) How 
can one tell whether “[z]” means the greatest integer less than or equal to x 
or whether the brackets are used to indicate grouping? 


PROBLEMS 

1Mff = mi+),f(x) =? 

2. What is the image of 3 under 81 — 2? 

3. Distinguish between 5 and 5. 4. Describe the graphs of 0, 1, —1. 

5. Show that f defined by “f(x) = the place to which road z led in the days 
of the Roman empire” is a constant function. What is its domain? its range? 

6. What function is both a direct variation and a constant function? 

7. Show that if two ordered pairs (+ (0, 0)) belong to the same direct varia- 
tion, then their components are proportional; that is, (71, y1) Ε mI A (2x2, y2) € 
mI - γι χι = ye/x2. | 

8. Show that the conclusion of Problem 7 is equivalent to yi/y2 = 21/2. 

9. When in scientific discourse it is said that “y varies directly with x” or 
that “y is proportional to 2,” the meaning is that there exists a number m such 
that (x, y) © mI whenever x and y are corresponding values of the variables, 
Le., that y = mz defines the relation between the quantities involved, and we 
have a direct variation. For example, for constant speed, distance is propor- 
tional to time. Letting y = distance and x = time, y = mx for some m. 
Here m = the speed. If we know one pair of values, we can calculate m and 
so determine the function. We call m the constant of proportionality. Suppose 
that we know that when the time is 6 sec the distance is 25 ft. Find the func- 
tion and determine the distance when time is 10 sec. 

10. The revenue from a sale is proportional to the number of units sold. What 
do we call the constant of proportionality? Find it if 50 units yield a revenue 
of $225. What revenue would be yielded by 28 units? | 
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11. The simple interest on a fixed deposit is proportional to the time. 
(a) What is the constant of proportionality called? If interest on $150 for 2 
years is $3, (Ὁ) find the constant, and (c) determine the interest for 18 months. 

12. If response varies directly as the stimulus and response is 25 when stimulus 
is 4, find the direct variation relating stimulus and response and determine the 
response when the stimulus is 8. 

13. (a) If y varies directly with x, does it follow that y increases when x 
increases? (b) By how much does y change if x increases one unit and x and y 
are related by mI? 

14. If y varies directly with 2, does x vary directly with y? 

15. The all-or-none law of response of a neuron to a stimulus is described as 
follows by Boring in Foundations of Psychology. “The magnitude of the activity 
in any single neural functional unit is as great as it can be in that unit at that 
time and is independent of the magnitude of the energy exciting it, provided 
only that the stimulating energy is sufficiently strong to excite the neuron at 
all.” Letting y = the activity of the neuron, x = the energy exciting it, 10 = 
the minimum energy required for a response, and 20 = the maximum activity, 
graph the function, and write a defining sentence. 

*16. Define sg (x) in terms of [1]. 


Graph 17 through 19. 
w17. y = [77,2]. 18. y = [22 — 1. 19. y = |a| + x. 


20. A car travels for 1 hr at 20 mi/hr, stops for 1/2 hr, then travels for 2 hr 
at 40 mi/hr. Letting s = the road distance from the starting point, andt = the 
time elapsed, graph the function relating ¢ and 8, considering ¢ as the inde- 
pendent variable. Is ¢ a function of s? 


Graph 21 through 23. 
Ἃ21. y = [x]. 22. y = [12]. 23. y = αὶ — [2]. 


24. According to The Biology of Population Growth, by R. Pearl, the density 
d of a fly population is defined by d = N/v, where N is the number of flies and 
v is the volume of effective free space. Experiments indicate that the density 
of the maximum population varies directly as the volume. Show that the 
size of the maximum population varies directly as the square of the volume 
of the space available. | 
*25. “For constant values of superthreshold reaction potential set up by 
massed practice, the number of unreinforced responses (7) producible by massed 
extinction procedure is a linearly decreasing function of the magnitude of the 
work (W) involved in operating the manipulanda, 1.6., n = 3.25(1.1476 — 
0.00984W).” (A Behavior System, by C. L. Hull) Graph the function. How 
is the “decreasing” character reflected in the formula and graph? 
* 26. A Finnish geographer who studied the pattern of geographical distribu- 
tion of industries around the town of Turku in the 18th century found that the 
value of loads of lumber, rye, and flax varied as the square of the distance they 
were hauled to market. In particular he found that a load of lumber hauled a 
distance of 50 km had a value of 7 and a load of rye hauled a distance of 
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(6,3) (8, 3) 
ὡ------.......Ὁ0 


(2,1) (4,1) 6,1) 


0, 0) 


FIGURE 5-24 


(1, 2) 


FIGURE 5-26 Figure 5-27 


140 km had a value of 54. Find the constant of proportionality and verify his 
claim. If the value of a load of flax was 135, how far was it hauled? 

%* 27. If V is the volume of a gas at constant pressure and ¢ is the corresponding 
temperature, then V = Vo(1 + δ(ἐ — to)). Show that this defines a linear 
function. Describe Vo. Describe ὃ. Sketch. 


ANSWERS TO EXERCISES 


(a) Set of ordered pairs whose second components are three more than twice 
their first; graph is a straight line with slope 2 and intercept (0,3). (Ὁ) Set 
of pairs all of whose second components are 8. (6) Lines through origin with 
slopes of —1, 2, —8. (d) See Fig. 5-24. (e) See Fig. 5-25. (ἢ f(x) = x 
for 0 “ ἃ “ 2, f(z) = 3 for 2 < x < 3, and f(x) = —z+ 6 for x > 38. 
(g) Y= LADS #e¢<2VY=2ZA2Z2<24<3)VYrz1A4N2<5)V 
(y¥=3A6S 2 8). 

(h) (4-5-4) with the points (2.5,0) and (4.5, 100). G) y = 425} — 88. 
4) > 0;2 «0. (k) See Fig. 5-26. (1) From (5) J(z) = x. (m) 0, 1, 
—1,1,—1. (nm) Cases. (0) Positive reals. {9,14,19,....} (Ὁ) Like Fig. 
5-22 but with first step at y = 4 and risers of height 4. (4) 1, 1, 2,0, —2, —3. 
(r) See Fig. 5-27. (s) Same. (Ὁ) Check cases. (u) Context. 
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(0, 0) 
Figure 5-28 FiGcuRE 5-29 


FIGuRE 5-380 Figure 5-31 


‘Figure 5-32 


ANSWERS TO PROBLEMS 


1. mxz-+ ὃ. 3. A real number; the set of all ordered pairs whose first com- 
ponent is a real number and whose second component is 5. 5. Since all roads 
led to Rome, the image of x is always Rome. Domain is set of all roads at that 
time, range is the singleton {Rome}. 7. γι = maz A ye = m2 - y1/%1 = 
m A γα χα = m. 9. 25 = 6m. Hence m = 25/6 and the function is (25/6) f. 
When zx = 10, y = (25/6)(10) = 250/6 = 125/38. 11. (a) The interest per 
period, (b) $1.50, (c) $2.25. 18. (a) No, only if m > 0, (b) m units. 15. See 
Fig. 5-28. 17. See Fig. 5-29. 19. See Fig. 5-30. 21. See Fig. 5-31. 22. Not 
the same as Problem 21! 23. See Fig. 5-32. 25. Negative slope. 27. Vo is the 
image of tg; ὃ is the change in volume per unit of original volume per unit 
change in temperature. 
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5-8 Quadratic functions. A function with defining formula of the form 
az” + bx + ¢ with a σέ 0 is called a quadratic function. We define 


(1) Def. al? +b2 +c= {(z,y)|y = ax? + br +e A ve Re}. 


Consider, for example, the function 277 — 31] — 4 defined by y = 
2χ" — 3x — 4. By making a table for a few integral values of x and 
drawing a smooth curve through the corresponding points, we find the 
graph of Fig. 5-33. The graph appears to extend indefinitely upward but 
to have a lowest point. By completing the square, as in Section 4-8, 
we find 


(2) | 2." — 32 — 4 = 2(x — 3/4)? — 41/8. 

Since the first term on the right is never negative and is zero for x = 3/4, 
we see that the smallest value of y is —41/8 and that the lowest point on 
the graph is (3/4, —41/8). This means that the range of 2x” — 3x — 4 


is the interval (—41/8 οὐ). 
More generally, 


(3) ax? + ber +c = a(x + b/2a)? + (4ac — b?)/4a. 
If a > 0, the first term is never negative and is zero for x = —b/2a. 


υ 


Figure 5-33 
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Hence the minimum is (4ac — b?)/4a. On the other hand, if a < 0, the 
first term is never positive and we have a maximum at the point | 


(—b/2a, (4ac — b*)/4a). 
(a) Sketch —27? — 31 -Ἐ 4 by plotting selected points. Find the highest 
point on the graph by completing the square. Find the range of the function. 
We can summarize the above results as follows: 


(4) f=al?+blt+eAa>O0 
— min Rge (ἢ) = f(—b/2a) = (4ac — b?)/4a, 


(55) f=al*+bl+canAa<oO 
— max Rge (f) = f(—b/2a) = (4ac — b*)/4a. 


We call the high or low point the vertex of the curve. Evidently in every 
case the vertex is the point corresponding to 7 = —b/2a, a result that is 
easy to remember. 


(b) What is the vertex of the graph in Exercise (a)? 


Let us imagine a company producing and selling x units of output at 
a cost C given by C = x” + 100x + 100. Suppose that sales 2 and price 


p are related by « = —p + 200, that is, p = 200 — ὦ, so that revenue 
R from sales is given by R = px = 200x — x”. Then profit y is given 
by y= R—C = —2x?+ 100; — 100. This equation defines the 


function —2J* + 1007 — 100. For what output does the company 
maximize its profit y? Using (5)(a:—2, b: 100, c:—100), we find that for 
an output of 25 the company has a maximum profit. 


(c) Plot the profit function. (4) The path of a projectile fired from the 
origin in the zy-plane is given by y = 2x — x”. How high does it rise and 
where does it again reach the level from which it was fired? 


Looking again at Fig. 5-33, we observe that we have not yet found the 
points where the graph crosses the x-axis. These, of course, are the points 
where y = 0, that is, where ax? + ba + ὁ = 0. They are easily found by 
(2-8-21) to be —b/2a — +/b? — 4ac/2a and —b/2a -+ ~/b2 — 4ac/2a. 


(6) Find the coordinates of these points in Fig. 5-33. 


The above results show that the two xz-intercepts of a quadratic function 
lie at equal distances to the right and left of the line x = —b/2a. The ap- 
pearance of Fig. 5-33 suggests that if we draw any horizontal line meeting 
the curve, its points of intersection will be equidistant from the line 
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υ 


x = —b/2a 


i 
| 
| 
| 
| 
| 
| 
-(b2-4ac)""| (6? -4a0)"7 

| 

| 


: (= 4ac — s 
Qa’ Aa 


FIGURE 5-34 FiGcuRE 5-35 


ῳ = —b/2a. To prove this let f(x) = a? + bx + ¢. Then f(—b/2a + h) 
a(—b/2a +h)? + b(—b/2a +h) + ¢ = a(b?/4a? — bh/a + h?) 
b(—b/2a + h) - ¢ = b?/4a — bh + ah? — b7/2a + bh + ἐς = ah? 
b?/4a +c. Similarly, f(—b/2a + h) = ah? — b?/4a + ς. 

This means that if we move ἢ units to the right from the vertex we get 
the same y-coordinate for a point on the curve as if we moved h units to 
the left. The situation is sketched in Fig. 5-34. We describe this property 
by saying that the curve is symmetric about the line x = —b/2a, and we 
call the line ᾧ = —b/2a the axis of symmetry. (Symmetry is discussed 
systematically in Sections 5-14 and 5-15.) 


(f) Sketch 12 — 21 - 2 by finding the vertex, axis of symmetry, and a 
few other points. 


| + I 


From the discussion in Section 2-8 it is evident that the graph of the 
quadratic function may fail to meet the z-axis at all. Indeed, it will meet 
the z-axis if and only if the discriminant δ᾽ — 4ac is non-negative. If 
a > 0, we have the three cases sketched in Fig. 5-35. When the discrimi- 
nant b? — 4ac is positive, we have two points of intersection; when it is 
zero, the graph is tangent to the x-axis; when it is negative, the graph lies 
entirely above the z-axis. 


(g) Discuss the case a < 0. (h) Obtain results found in the above para- 
graph and in Exercise (g) from (4) and (5) without using results concerning the 
roots of quadratic equations. 


In Section 3-5 we gave some methods of solving quadratic inequalities. 
The results of the present section make it easy to solve any inequality of 
the form az? + br Ἐς < (>, <, >) 0. We simply sketch the function 
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al? + Ὁ] + ς and observe where it is above or below the x-axis. The 
solution is either an interval between the roots of az? + br + ¢ = 0 or 
the complement of this interval. If there are no real roots, the inequality 
is satisfied by all real numbers or none. 


Verify the previous comments by solving the following inequalities: 
(i) 227 +- 82 — 1 «0, (j) 2.2 - 327 — 1 > 0, (k) x? — 41. -᾿ δ > 0, 
() 955 Ξε 135 


Since we are often concerned with the maximum and minimum values 
of the dependent variable and with the values of the independent variable 
for which the dependent variable is zero, it is convenient to have some 
special terminology. We call max Rge (f) the maximum of the function f 
and min Rge (f) the minimum of the function f. If (29) = max Rge (J), 
we say that f takes its maximum at xp. Similarly, if {(40) = min ἤρα (f), 
we say that f takes its minimum at 2. If f(xo) = 0, we call xo a zero of 
the function. A zero of a function f is thus simply a root of the equation 


f(x) = 0. 


(m) Give an example of a function that has neither a maximum nor a mini- 
mum. (n) Give an example of one that has a maximum but no minimum. 
(o) Assuming that a, ὃ, and c are real, write a necessary and sufficient condition 
that the zeros of aJ* + δ1 -+-c are imaginary, (p) equal, (q) unequal. 


PROBLEMS 


Graph the functions in Problems 1 through 6. Show the axis of symmetry, 
vertex, and intercepts. Find the zeros, maxima, and minima. For each one 
find the subset of the domain for which the images are positive and the subset 
for which they are negative. 


1.72 —1. 2, [2+ 3. ey Coen τοῦ 
4, 972+ 61 +1. δ. 912+ 8] — 1. 6. —312 + 21+ 2. 


7. Solve for m: 10m? — 1386m/N/R-+ N/R? = 0. 
8. Solve for a: a(a — 1)+ poat 40 = 0. 
*9. Find the quadratic function whose zeros are the squares of the zeros of 
al?+ bI +c. | 
*10. The following formula appears in Fertility and Reproduction, by R. R. 
Kuezynski: 


T = (1/2)[(Ri/Ro + V (Ri/ Ro)? — 2(R2/ Ro — (Κι Ro)2) log. Ro). 


Do you think that this formula was derived by solving a quadratic equation? 
Why or why not? If so, what is the other root? Write an equation of which 
this 1s a root. 

11. Solve for r and for p: (r — p)?. = p(1 — p)/bd. 

12. For what values of & will the graph of the function kJ? ++ 3kI — 4 lie 
entirely below the x-axis? 
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*13. A company has a cost function given by C = Azv?4+ Bx-+C and a 
demand function given by x = ap-t+ ὃ. Find the output and price for maximum 
profit. Argue that A > 0 and a < 0, and show that the result must actually 
be a maximum and not a minimum. | 

14. When a projectile is launched with muzzle velocity of vo in a direction 
making an angle a with the horizontal, its path is given by x = (vo cos a), 
y = (vosina)t — (1/2)gt?, where ¢ is the time. What is the highest point on 
its path? Where does it strike the ground? Show that its path is the graph of 
a quadratic function and find this function. For what angle ἃ will it go the 
farthest? 


Graph 15 and 16. 


WIS. y= “2 - ἘΣ ΌΞΕΙ ΔΛ χες(- 2 3). 
W16. y > 22° — Bt ~-1L Ay 2A —-1<2< 1. 


ANSWERS TO EXERCISES 


(a) Rangeis(— © 41/8). (Ὁ) (—3/4, 41/8). (4) To (J, 1) and at (2, 0). 
(6) (—0.85, 0), (2.8,0). (Ὁ Vertex is (1,1), axis of symmetry is the line 
x = 1. (g) Same, except that all curves open downward and curve is entirely 
below x-axis if discriminant is negative. (h) The sign of the minimum or maxi- 
mum of the range determines whether the curve crosses the axis; it does so if 
this sign is opposite to that of a. (1) Openinterval. (1) Union of two infinite 
intervals. (k) (--ῷὸ οὐ). () Null set. (m) Any nonconstant linear 
function. (n) Quadratic witha < 0. (ο) Discriminant negative,  (p) zero; 
(q) nonzero. | 


ANSWERS TO PROBLEMS 


1. Positive on (—*©_0) U(1_&). 7. (68 4 V4614)VN/10R. 9. Check 
by substitution. 13. x = (6+ aB)/201 — aA) (see Mathematical Introduction 
to Economics, by G. C. Evans, for more problems of this kind). 


5-9 Composition. Suppose a ship P is sailing due east on a line 50 mi 
north of an observation point O. We are interested in the distance OP in 
Fig. 5-36, where x is the distance to the ship from the point B on its 
course nearest to O. Clearly y = »/502 + x2. If the ship is maintaining 


Figure 5-36 
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a speed of 20 mi/hr, and if we measure time ¢ beginning at the point B, 
we have x = 20t. Then y = ν 802 + (20t)?. Here we have three func- 
tions, one relating ¢ and 2, one relating xz and y, and one relating ¢ and y. 
We say that the third is the composite of the first two. To find the image 
of ἐ under the composite, we find the image zx of ¢ under the first function 
and then the image y of x under the second function; that is, we go from 
ἐ ἴο x and then from z to y. 

Again, suppose that y is the father of z and z is the father of x. We have 
z= Fa (a) and y = Fa(z). Then y = Fa[Fa (x)]. We call the function 
defined by this last equation the composite of Fa and Fa. Suppose that 
zis a parent of x and that y is a parent of z; that is, x Pa z and καὶ Pa y. 
Then y is a grandparent of x, or x Gp y. We call Gp the composite of Pa 
and Pa. Now suppose we are given x and y and wish to show that x Gp y. 
We should have to show that there was some individual who was a parent 
of x and a child of y; that is, Jez Paz A z Pa y. 

We use this idea to define the composite p - σ of the relations p and o 
as follows: | 


(1) Def. pea= {(z,y)|JzexrpzAec y}, 
(2) e(peagvymdizrpzAzcy, 
(3) ( ρε: λ :σὺ) -- α (ρ ο σὴ ν. 


From (3) we see that to find the images of x under p © a, we find the 
images of x under p and then the images of these images under ¢. For 
example, to find the grandparents of x we find the parents of z and then 
the parents of these. When the relations are functions, it is easy to find 
a defining formula for the composite from those of the two functions. If 
F and G are functions, (2) becomes y = (F ° G)(x) «» Ξ2 5 = (2) NyY= 
G(z) «νὰν = G[F(2z)|. The last equation says simply that the image 
under (Γ΄ ὁ G) is found by finding the image under F, namely F(x), and 
then the image of this under G, namely G[F(x)]. This was exactly the pro- 
cedure we followed in the example at the beginning of the section, where 
we have F(t) = 20t, G(x) = W502 + 22, and G[F()] = V502 + (202)2. 
We have, then, 


(4) (F > G(x) = G[F(2)]. 
Note the reversal of order. If we think in terms of operations, Ff 9 G 
means the operation F followed by the operation G, and so does G[F(zx)]. 


It seems natural to use G[F] to stand for the composite of F and G, when 
F and G are functions. Hence we adopt the notation, 


(5) Def. G[F] = F ¢ G. 
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The notation on the left could be used also for nonfunctional relations, 
but its convenience comes from its similarity to G[F(x)]. With this nota- 
tion, (4) becomes 


(4) (Ο1}])(4) = GIF@)]. 


(a) Suppose y = z* and z = 22. What is the relation between x and y? 
Find the defining formula of: (b) (22) ο (3, (ce) (12) > (27+ 1), 
(d) (22+ 1).12. (e) What is the meaning of y = Mo (Fa (z))? 
(Ὁ If F(x) = 1 — xand G(z) = 32, find G[F(zx)] and F[G(z)]. 


We see from Exercise (f) that composition of relations is not commuta- 
tive. However, a little reflection shows that it is associative; that is, 


(6) β»5 (σ ο Τὴ = (ρ ο σὴ oF. 


Indeed, the images of x under the left side are found by determining the 
images under p, then the images of these under σ᾽ ὁ 7, that is, the images 
of these under o and then the images of these under 7. But the right side 
calls for the same operations. 

Since composition is associative, it 1s satin to ask whether we ‘can 
derive other results analogous to the laws of the algebra of numbers. 
For example, is there a relation that plays a role like that of the number 1 
under multiplication and has the property similar to that given in the 
law l1-a =a-:1= a? We define 


(7) Def. I4 = {(2,y)|e = y} N (A X A). 


We call 74. the zdentity relation on A. It is evidently the set of all ordered 


pairs with identical members belonging to A. Then, 
» 


(8) pSCAXA——>(l4e-p=p-l, = p), 
(9) pL&AXB—(I4° p= p), 
(10) pCAXB— (po [5 = p). 


Let. p and o be defined by Fig. 5-8. Graph: (g) poo, (h) aep, (i) Ia 
where A = {0, 1, 2, 3,4, 5,6}. (7) Verify (8) for these relations. (Κ) What 
is the relation between 14 and I defined by (5-7-5)? 


The above results are fairly obvious, since 74 maps any 2 into itself. 
Hence [4 ο p maps z into z and then into images of x under p, and p ° I, 
maps each Ζ into its images under p and then these into themselves. 
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When p is a function, (8) becomes 
(11) FCGCAXA— (F{l4] = IalF] = FP). 


PROBLEMS 


In Problems 1 through 12, defining formulas for 17 and G are given in that 
order. Find a defining formula for ΟἿ}. | 


ie 2e, Ge. 2. 2x, 32 +. 1. 3. χ2, 2 -Ἐ 1. 

4. 1. -Ἐ Ὶ, 2%. 5. 175, 1/2. 6.3+24,3 -- ζΖ. 
7. χ", 2". 8. 2, 52. 9. 52, 2. 
10. —z, —2z. 11. sin x, 27. 12. 32, cos x. 


Determine 13 through 18 by finding defining formulas. 


13. (21) e (5D). 14. (J2 + 21)» (31). 
15. 41 o sin. 16. 50 12, 
17. 3: 04 7 1). 18. (mI +b) (mI +B). 


%* 19. Suppose that a point moves so that the vector to it from a fixed point 
moves through an angle of ὦ units per unit time. Then we call ὦ (omega) the 
angular velocity about the fixed point. If the point moves in a circle of radius r 
about the fixed point, it covers an are of rw per unit time. We call rw its linear 
velocity. Suppose the angular velocity varies with time, so that ὦ = 3f. Find 
the function that maps the time into the linear velocity if the radius is 25 ft. 
* 20. The radius of a circle is increasing at the rate of 100 yd/min. Find the 
function that gives the area of the circle in terms of time ¢ if the radius is zero 
when ἐ = 0. 

*% 21. Show that the composite of two linear functions is linear. 

*% 22. Show that composition of linear functions is not commutative. 

#23. Show that the composite of a linear and quadratic function in either order 
is a quadratic function. 


Rge GCF) 
G(F(x)) 


FIGuRE 5-37 
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% 24. With the aid of Fig. 5-87, justify 


(12) Dom (F' > G) € Dom (F), 
(13) | Rge (Fo G) C Rge (G). 


% 25. Draw a figure similar to Fig. 5-13 to suggest the idea of composition. 
* 26. Find the domain and the range of Hu, Mo, and Hue Mo. Verify (12) 
and (13). Note how Rge (Hu) and Dom (Mo) overlap. 

*% 27. Of what set is Rge (F  G) the map? 

* 28. Repeat Problem 26 for Moo Hu. 

*29, Do the same for F = {(2,y)Jy = 27 Axed}, G= {(z,y)ly = 
2x A x is odd}. 

* 30. Translate into words: x(Pao Mo)y; x (Moo Pa) y; x(Soe Da) ο Soy; 
z (Wic Hu) > Mo y. 

*#31. Identify: >> >; =o =; 4o #, 

* 32. Is it possible for the composite of two nonfunctional relations to be a 
function? 


ANSWERS TO EXERCISES 
(a) It is defined by y = 4x7; that is, it is 477. (Ὁ) 61. (ce) 272+ 1. 
(d) 477+ 27+ 1. (ὁ) y is the mother of the father of x. (ὃ 3(1 — x)? 


and 1 — 812, (g) through (i) See Fig. 5-38. (k) I = Ip., where Re = the 
real numbers. 


υ 


Do © Be σι oq 
no Oo Fe wo mS 


Figure 5-38 


ANSWERS TO PROBLEMS 


1, 10x. 8. 27 +1. 5. 2. 7. vt. 9. 2. 11. Qsinz. 13. 107. 15. y = sin 4. 
17. —2l — 2. 18. m’mI-+ m’b+ δ΄. 19. Letting y = the linear velocity, 
y = 25w andw = 3t; hence y = 75¢ and the function is 757. 21. See Problem 
18. 23. (mI - δ) ° (AI? -+ BI + C) = Am?I? + (2Amb-+ mB)I + Ab? + 
Bb+C; (Al? 4+ BI + C)- (mI +b) = mAI2+ mBI + mC +b. 25. Two 
machines, one feeding into the other. 27. Rge (F) ὦ Dom (4). 
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5-10 Converse relations. To introduce the concept of the converse of 
a relation, we ask: if x is greater than y, so that (xz, y) € >, what is the 
relation that y bears to x? Obviously y is less than z, so that (y, x) € <. 
We call each of these relations the converse of the other. The converse p* 
of a relation p is the relation such that y ρὲ x if and only if z p y; that is, 


(1) Def. p* = {(z,y)|(y, x) € p}, 
(2) [(b, a) Ε p*] «» [(a, Ὁ) Ε ol, 
(3) (b p* a) «» (ap 5). 


These laws mean that the converse of a relation is obtained by inter- 
changing components in each pair belonging to the relation. 


(a) Graph ρὲ and o*, where p and σ are defined by Fig. 5-8. (Ὁ) Graph 
{(1, 1), (8, 1), (—2, —5), (1, —1)} and its converse on the same drawing, using 
circles for points belonging to the original relation and crosses for those be- 
longing to the converse. With a straight line connect each point with the corre- 
sponding point in the graph of the converse. 


As suggested by the solution to Exercise (b), the points (a, δ) and (6, a) 
are always placed so that the graph of J (the line defined by y = 2) is 
the perpendicular bisector of the segment Joining them. This can easily 
be proved by reference to Fig. 5-39. Indeed, the triangles OAB and 
OA’'B’ are congruent because they are right triangles whose sides have 
lengths ja| and |b]. Hence OB = OB’. Also, since Ζ 408 = ZA'OB’ 
and ZAOC = ΖΑΌΟ, ZBOC = ZB’OC. It then follows that B’C = 
BC and that I is perpendicular to BB’, as we wished to prove. 


(c) Justify the last statement. (d) Make a new drawing and check the 
proof for a point in the second quadrant. (6) What if (a, δ) lies in the line I? 


From the above it follows that the cartesian graph of p* 1s obtained by 
“reflecting” the graph of p in the line defined by y = x. We use the word 
“reflecting,” because an object reflected in a mirror appears to be located 
on the other side of the mirror a distance equal to its distance in front of 
the mirror, just as (b, a) is located in relation to the line y = x and the 
point (a, b). Another way of describing the relation is to say that the graph 
of p* may be obtained by rotating the graph of p up out of the plane around 
the line y = x through 180° 


Use the above idea to graph the converse of: (ἢ) 21, (g) I”. 
(4) {(x, y) | s(x, y)}* = E(w, ψ) [sy, 2)}. 


This law, which follows immediately from (2), indicates explicitly that 
a defining sentence of the converse of a relation is obtained by inter- 
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FIGURE 5-39 Fiaure 5-40 


changing the variables in a defining sentence of the original relation. For 
example, 21 is defined by y = 2x. Hence (2/)* is defined by x = 2y or 
y = x/2. Hence (2/)* = (1/2), which is confirmed by Exercise (f). 
Similarly, J? is defined by y = x”. Hence (J?)* is defined by y? = zx. 


(h) What is (27 + i1)*? Find a defining formula and graph 21 + 1 and its 
converse. 


It may be helpful to visualize a relation and its converse as mappings. 
A relation p maps each element in its domain into the corresponding 
images in its range. The converse p* maps each element in its domain 
(the range of p) into its images under p*. The images of x under p* are 
precisely the objects one of whose images under p is x. This is suggested 
in Fig. 5-40, where, for simplicity, we indicate only one image under both 
relation and converse. 


G7) Make a graph similar to Fig. 5-16 of the converse of the function graphed 
there. (1) Formulate a rule for making this kind of a representation of a 
converse. 


We see from Exercise (g) that the converse of a function may not be a 
function. However, if both F and F* are functions, each establishes a 
one-to-one correspondence. When F and F* are functions we call each 
the znverse of the other. The following hold for inverses. 


(5) ΡῈ = {(z,y)|2 = FY)}, 
(6) F*(x) = [ya x = Fiy)], 
(7) ly = F*(z)] o [t = F(y)], 
(8) F*(F(x)] = 2, 


(9) FIF*(y)] = ψ. 
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Theorem (5) is simply (4) applied to a relation with a defining sentence 
of the form y = F(x). The others follow from the discussion in Section 
o-4. In (9) we use y to remind the reader that the range of the variable is 
the domain of F*, 1.e., the range of F. 


(k) Is F*(F(x2)] = FLF*(2)] = ᾧ alaw? (1) Argue for (8) and (9) in terms 
of mappings. 


We now have procedures for investigating the converse of a known 
relation. To find the graph, we rotate about the line y = x. To find a 
defining sentence, we interchange variables. If the converse of a function 
defined by y = F(x) is also a function, we can find its defining formula 
by solving « = F(y) for y so as to get y = F*(x). For example, the in- 
verse of 2/ + 3, defined by y = 2x + 3, is obtained by solving x = 2y + 
3 for y. We find y = x/2 — 3/2, so that (21 + 3)* = 1/2 — 3/2. 


(m) Find (mi -+ δ). (ὦ) Is there any linear function whose converse is 
not a function? (0) Describe mJ -}+ δ and its inverse as operations. 
(0) Which linear functions are their own inverses? (q) Graph the relation 
defined by y = Vz, and graph its converse. 


PROBLEMS 


In Problems 1 through 12 graph the relation indicated by a defining sentence 
or name. Find a defining sentence for the converse, and a defining formula if 
the converse is a function. 


ly =—a2z+1. 2. —2I — 5. δι ψ = 1 — 2’. 

4. y = χ3. 5. y = 22? A «> 0. 6B y= e?Ax <0 
7. = 2, 8.y = --2. 9. 72 -+ y? = 4. 
10. y = 1/z. 11. 12 — 1. 12. J*-+- 2]. 

For Problems 13 through 16 verify (8) and (9). 

13. F = —I — 7. 14. F = 107+ 18. 


15. F = {(z,y)|y = 22 λα 1]. 16. F = {(, 1), (2,7), (8, --1}}. 
Identify the relations in 17 through 26. 


17. «-, 18. ΣΈ, 19. <*. 20. >* 21. 15. 
22. (—I)*. «23. Hux. 24, Wit. #25, Sb*. *26. Bre. 


27. Find a defining sentence for the converse of the function AJ? + BI + Ο. 
Express this converse as a union of two functions. 

28. Graph 2/2? + I — 1 and (21*-+ 1 — 1)* on the same drawing. 
*% 29. Defining zx Chy =gq y is a child of x, define Ch as the converse of an 
already defined relation. 


*30. Derive the following from the definition of the converse. 
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(10) [Dom (R*) = Rge (R)] A [Rge (R*) = Dom (£)j, 
(11) (R*)* = ἢ, 

(12) (RU S)* = R* U 85, 

(13) (RN S)* = R* ἢ S*, 

(14) πὲ = R*’, 

(15) [Ἐ = I, 


31. Argue for the following law in terms of images and mappings: 
(16) (poa)* = ρὲ oo, 


*% 32. Noting that (y is a parent of the wife of x) = (x 1s the husband of a 
child of y), illustrate (16) for various familial relations. 

33. Argue for (16) in terms of operations. 

34. Verify (16) for p = 32+ 2ando0 = —I- 5. 
%*%35. Prove (16). 
*%36. Show that pCa — p* Cor, 
%*%37. What happens if we take the composite of a relation and its converse? 
Show the following and illustrate them for familial relations. 


(17) a(p*¥op)y dzzpxrAzpy, 
(18) z(pop¥)ydzxp2zn y pz. 
* 38. Show that, if F and F* are functions, 

(19) F*[F] = Ipomiu); 

(20) PLP*) = ΤΠ προ: 


*% 39. Illustrate (19) and (20) with familial relations. 

* 40. In the sentence x € A, what is the subject? the verb? the object? When 
names of relations are used as verbs, what is the grammatical role of a name of 
the converse? the complement? Give the converse and complement of 
“dominates.” 


ANSWERS TO EXERCISES 


(a) Check by later discussion. (Ὁ) {(1,1), (1,3), (—5, —2), (~1, ID}. 
(c) OC is bisector of vertex angle of an isosceles triangle; hence it is also a 
median and an altitude. (6) Then (a,b) = (6, a), and the segment reduces 
to a point. (f) See Fig. 5-41. (g) See Fig. 5-42. (8) (1/2)J — 1/2. 
(i) Reverse arrows. (7) See Exercise (1). (k) Yes, but the range of zx is 
Dom (F) N Dom (F*). (1) F carries x into its image, and F* carries this 
image back into z. (m) (1/m)(I — δ). (n) The constant functions. 
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¥ 
12 


ὠ ἢ" 
a: 


Figure 5-41 FIGURE 5-42 


(o) mI + ὃ multiplies by m and adds ὃ; (mJ + 6)* subtracts ὃ and divides 
by m. (p) 1 and —I-+ ὃ. (q) The graph of the inverse is just the right 
half of the graph of 12. 


ANSWERS TO PROBLEMS 


1. y = —ax-+ 1 (own inverse); 3. y2 =1—2. By = Va Ty = Wo. 
9. x? -+ y? = 4 (own inverse). 11. y? = a2+1. 13. F* = —I — 7; 
F*(F(x)) = --Ἕ--ὦ -- 7) --͵7 Ξ- ν. 15. ἔξ = {@,yly = Ve λ τ δ 1); 


F*(F(z2)) = Va? = x; F(F*(x)) = (Vx)? = x. Note that without some 
limitation on 2, the inverse of the function defined by y = 2? is not a function. 
17. >. 19. >. 21. 71. 23. Wi. 25. Sb. 27. Ay*?+ By+C = z. 


_ \/B2 — 4A(C — 2x) 
(AP + BI +0)* = ω yly = se at oer 


—B+W/B? — 4A(C — Ἴ 
U [ω yy a 
29. Ch = Pa*. 33. We perform p then o in peo. To undo this, we start by 
undoing the last operation, i.e., by performing σὲ then p*. 35. y (pe a)* 7 
epcrcay Jzxrp2z2Kzay > Jz ptx A yo®t ze sy ot2z2 Λ z pk 4 
y o* o p¥ χ. 

37. y(Pa* o Pa)x — y and x are parents of the same child. 39. 2(Hu > Hu*)y = 
z(Hu - Wi)y — (y = 2x) in our society. 40. x is the subject, € is the verb, 
A is the object. The converse is represented by the passive if the original verb 
was in active form, or the active if the original was in passive form. The comple- 
ment is represented by a negated verb. (dominates)* = is dominated by; 
dominates’ = does not dominate. 
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5-11 Circular functions. Figure 5-43 shows the monthly load y of 
active cases of family welfare agencies in a certain city in relation to time 
x during the years 1926 to 1929, where 100 = the average load in 1927. 
The function relating y and x has the property of practically repeating 
itself in successive intervals. A function f is said to be periodic if there 
exists a positive number 7 such that f(z + T) = f(x) for all x in Dom (f). 
Any such number is called a period, and the smallest period is called 
the period. 


Average 
for 1927 


1926 1927 1928 1929 


Fig. 5-48. Adapted from Statistics in Social Studies, by ὃ. A. Rice, 1930, p. 52. 


(a) State the definition of “periodic,” using quantifiers. (b) Cite examples 
of periodic functions. (c) Show that if 7 is a period, so is nT for n Ε Jt. 
(d) Why “positive number” in the definition? (6) What approximately is 
the period of the function of Fig. 5-43? 


Because of the cyclic, repetitive character of many natural and social 
phenomena (orbits of planets and atomic particles, motions in machines, 
biological cycles, waves, economic fluctuations, etc.), periodic functions 
are very common. Perhaps the most useful periodic functions, and the 
only ones we consider here, are the circular functions sin, cos, tan, cot, 
sec, and csc. For example, sin is defined by y = sin a, and is periodic ac- 
cording to (4-12-19). Here we use x instead of 6 as the independent vari- 
able. The geometric picture of the relation between x and sin x is sug- 
gested in Fig. 5-44, where x is the directed arc length from (1, 0) measured 
along the unit circle. The graph of sin in Fig. 5-45 may be obtained by 


Figure 5—44 
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y 
(1/2, 1) 


ΖΏΩΝ (π, 0) (ὥτ, 0) 
ὌΝ 2 


π΄, 0) (0, 0) 


( 


(ΞΞ 22, -- 1) 


Figure 5-45 


measuring directed ares x along the horizontal axis and erecting verticals 
of directed length sin x measured on Fig. 5-44, that is, by “unwrapping” 
Fig. 5-44. 


(Ὁ) What are the domain and range of sin? Whatis the period? (g) Review 
Sections 4-11, 4-12, and 4-13. Why do you think we use the term “circular” 
here? (8) From (4-12-9) prove Vz cos x = sin (x + 7/2). 

Because of the identity proved in Exercise (h), the graph of cos may be ob- 
tained by shifting the graph of sin a distance of 7/2 toward the left. (4) Ex- 
plain this. (j) Make a large graph of cos. What is the period of cos? 


A comparison of Figures 5-43 and 5-45 suggests that the latter could 
be made to “fit” the former by only moderate distortion. Consider 
A sin{mI -+ ὃ] defined by 


(1) y = Asin(mz + δ). 


Its domain is the real numbers, as for the sin, but its range is (—|A|__|A)). 
We call | A| the amplitude of the function. Since sin(m(a + 2r/m) + ὃ) = 
sin (mz + ὃ -+ 27) = sin (mx + ὃ), a period is 27/m. Since this is clearly 
the smallest positive number with this property, 27/m is the period. 
If we think of x as the time, the period is the time required to repeat. 
If we take its reciprocal (divide one unit of time by the time per repetition), 
we get the frequency f defined by f = 1/7. For (1) the frequency is m/2r. 

To see the significance of ὃ, let us compare (1) with the function defined 
by y = Asin ma, where ὃ = 0. Noting that 


sin (mx + b) = sin m(x + b/m), 
we see that sin[mJ] and sin [mJ + ὃ] have the same range and domain, 


but that the graph of the latter may be obtained by shifting that of the 
former a distance b/m to the left. The number ὦ is called the phase. 
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FicgurRe 5-46 


Graph each of the following, showing a range including several periods: 
(k) 2sin, (1) sin[/2Z], (m) sin[Z/2], (n) βιη [1 4 7/3]. 


Obviously, the converse of a periodic function is not a function. In 
particular, x = sin y has infinitely many solutions for y corresponding to 
each value of x so that sin* is not a function. 


(o) Sketch sin*. 
However, by limiting the domain of sin, its converse becomes a function. 
(2) Def. Sin = {(z,y)|y = sing A —7/2 < αὶ < 7/2}. 


Then Sin* is a function, as suggested in Fig. 5-46. We call Sin* the 
Arcsin, and read “Arcsin v” as “the arc (or angle) whose sine is 2x.” 


(3) Def. Arcsin = Sin*. 

Similarly, 

(4) Def. Cos = {(z,y)|y = cost AO «ὦ <r}. 
(5) Def. Arccos = Cos*. 


(p) Sketch Sin. What are its domain and range? (q) Sketch Cos. (r) Sketch 
Arccos. What are its domain and range? 


Our choice of domain for Sin and Cos was dictated by a desire to have 
functions whose graphs are unbroken, have 0 in the domain, and contain 
points in the first quadrant. 

(s) Suggest an alternative for Sin that would violate only the first require- 
ment. (t) Suggest one that would violate only the second. (u) Suggest 
an alternative for Cos that would violate only the third requirement. (v) Show 
that the criteria completely determine the choice. 
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PROBLEMS 
1. Prove that 27 is the period of sin. 


Graph each function in Problems 2 through 10, indicating the amplitude 
and period of each. 


2. sin [31]. 3. cos [21]. 

4, —sin [1]. | 5. sin [1 + 2/6]. 

6. sin [J + 7/3]. 7. sin [I + 7/2}. 

8. sin [1 — 1/3]. ~~ 9. 8 sin [172 + 2/2]. 


10. 2 sin [2] — π76]. 


11. We wish to construct a sine curve that crosses the z-axis at x = 7/4, 
has a period of 7/2, and has a maximum of 10. What is its equation? 

12. Draw a sketch of tan. (Suggestion: Since tan z is undefined when z is an 
integral multiple of 2, draw vertical lines at such points on the z-axis. Make 
use of the identities of Section 4-12.) What is the period of tan? 

13. Sketch tan*. 

14. Using the criteria of the section, define “Tan” in such a way that Tan* 
is a function. Define “Arctan” and sketch Arctan. 


15. ArcsinO = ? 16. Arcsin] = ἢ 

17. Arcsin 2 = ? 18. Arcsin (—1) = ? 

19. Aresin (1/2) = ? 20. Arcsin (0.89) = ? 

21. Arecos (—1) = ἢ 22. ArccosQ = ? 

23. Arecos (0.89) = ? 24. Arctan (—1) = ? 

25. Aresin (—0.71) = ? 26. Arctan (5.8) = ? 

Prove 27 through 30. 

27. sin (Arcsin x) = 2. 28. sin (Arccos x) = V1 -- 2x2. 
29. cot (Arctan x) = 1/z. 30. sec (Arecos x) = 1/2. 
Find all solutions of Problems 31 through 39. 

91. snag = I. 32. 55 αὶ = 0.5. 

33. sinz = —0.3. 34. sin 2x = 1. 

35. sin 5z = 0.3. 36. sin?z = 1. 

37. sin?'a + 1 = 0. 38. sin?z + 2sinz — 3 = 0. 


39. 2cosa+ sin 225 = 0. 


*40. Show that all solutions of sin y = xz are given by y = Arcsin x + 2kr, 
where k is an integer. Define the set of images of x under sin*. 

*41. Treat cos* and tan* as we treated sin* in Problem 40. 

* 42. Show that the criteria by which we chose a definition for Sin, Cos, and 
Tan can be applied to cot but not to sec and csc. 

*43. If a body of mass m moves along a straight line so that its acceleration 
is always toward a fixed point and proportional to its distance from that point 
with a constant of proportionality equal to k, its position is given by z = 
A cos V k/mt. (Sears and Zemansky, University Physics, p. 195) What is A? 
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What is the period? What is the frequency? Define the same function as a 
composite of a linear function and sin. 

* 44. The phenomenon of wave motion is a familiar one. Examples are waves 
in water and the waves observed in a rope whose ends are moved periodically. 
Such waves appear to have a sinusoidal shape and to move (along the water or 
the rope, for example). Such a moving wave may be represented by an equation 
of the form y = A cos 2r(t/T — x/d), where ¢t is the time and z is the distance 
from some fixed point. (Sears and Zemansky, University Physics, p. 372) For 
fixed ἐ, the equation gives the shape of the moving wave at that instant. For 
fixed x, the equation gives the motion of a particle (of water or of the rope) 
at that point as a function of time. What is the period for ¢? for 2? With what 
speed is the wave moving and in which direction? Graph the equation for 
T = 1, = 0.5, = 1. Do the same for ἐ = 1.25. 

*%45. Show that 7/4 = 4 Arctan (1/5) — Arctan (1/239). 


ANSWERS TO EXERCISES 


(a) AT T SOA Ve 7Κ7{1- Τὴ = f(z). (b) Dates, the days of the 
week, positions of point on a rotating wheel, repeated daily activities, etc. 
(0) 1 - 27) =fa@tTt+yT) = F(z@4+T) = f(z), ete. (ἃ) The defini- 
tion holds for any function for T = 0; permitting negative T adds nothing. 
(6) 1 year. (f) Dom sin = Re, Rgesin = (—1 1). (g) Because of the 
definition in terms of arcs on unit circles. (h) Immediate, since sinz/2 = 1 
and cos7/2 = 0. (i) Fora fixed x, cos x has the same value as sin (x + 7/2), 
ie., the value that sin x has +/2 toward the right. (j) Looks like Fig. 5-45 
with y-axis shifted 1/2 to right; 27. 

(k) Like Fig. 5-45 but with high and low points at (7/2, 2), (3/2, —2), ete. 
(1) Like Fig. 5-45 but with zeros as 1/4, 7/2, 34/4, π, etc.; curve is compressed 
horizontally. (τὼ) Fig. 5-45 with curve expanded horizontally, zeros twice as 
far apart. (n) Fig. 5-45 moved 7/3 to the left. (0) Use Section 5-10. (p) Por- 
tion of Fig. 5-45 between (—2/2, —1) and (4/2, 1); domain is (—r/2_ 2/2), 
range is (-Ξ1 1). (q) See Fig. 5-47. (r) See Fig. 5-47; domain is (—1 1), 
range is (0 πὴ. (s) Let domain be (0 2/2) U (8a/2 2m). (0) Domain 
(1/2 3/2). (ἃ) (--π 0). (v) Start at x=0 and go into the first quadrant; 
the inverse must be a function. 


(=) T) 


Areeos 


Figure 5-47 
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ANSWERS TO PROBLEMS 


1. Letting z = 0, we have sin 7 = sin 0 for any period 7. The only value 
of T that satisfies this and that is smaller than 27 is 7. But 7 is not a period, 
since sin (7/2-+ 7) = —1 #1 =sin7/2. 3. 1,7. 5.1,2x. Shift is 7/6 
to left. 7. 1, 27. Same as cos. 9. 3, άπ. 11. y = 10sin (42 — - π)ὴ. 13. See 
Fig. 5-48. 15. 0. 17. Undefined. 19. 7/4. 21. π. 23. 0.48. 25. —0.78. 

27. (5-10-9). 29. tan (Arctan x) = z, but coty = l/tany. 31. {x|4kke. 
J Na = π,2-Ὁ Qkr}. 33. {x|aKRET A [x = -- 0.8 -Η 2ῖπν x = —2.844+ 
2kr]}. 35. {|3Κ EST A [x = 0.06 + 2kr/5 V x = 0.57 + 2kr/5]}. 
37. Nullset. 39. {27j/akk EJ A [x = kw V x = 34/2 -+ 2π]}. 


FIGURE 5-48 


5-12 Arithmetic operations on functions. If f and g are functions 
whose ranges are subsets of the real numbers, we define the sum of f and 
g, f +g, as the function that maps z into f(x) + g(x); that is, 


(1) Def. f+g= t@yly = f@ + 9@)}. 


From this definition we have immediately (f -+- g)(x) = f(x) + g(a). 


Complete the following: (a) The image of z under f + g is the 
of the images of x under —___- and_ under —_____.. (b) ftg = 
hah(e)=——___. __ (e) Dom(f+g) = ; 


In Fig. 5-49, we graph f, g, and f + g where f = 0.57 + 1 and g = 
--1 - 2. Note how we find f(z) + g(x) for a given x by going to (2, f(z)) 
and then a vertical directed distance g(x). Since these vectors can be 
measured on the graph with a ruler or straight edge and compasses, once 
f and g are sketched, f + g can be drawn without further calculations. 
For a given ὦ, we find f(x), g(x), and f(x) + g(x) from the drawing. 
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Ficure 5-49 Figure 5-50 


Complete the following: (d) The graph of f meets the graph of f-+ g at 
the values of x where _______. (6) The graph of g meets the graph of f+ g 
where ____.. (ἢ At values of x where the graphs of f and g cross, 
((--- σα) Ξ.---  Α Ὀ, (g) The graph of f+ g will be below that of f where 
______, (h) The graph of f+ g crosses the x-axis where _______.._ (i) Sketch 
fog in Fig. 5-50. 


The above procedure of graphing f + g is called addition of ordinates. 
(The y-coordinate of a point is sometimes called its ordinate.) Addition of 
ordinates is particularly useful when the nature of the graph of a sum is 
not obvious. For example, Fig. 5-51 shows the graph of y = sin 7, 
y = sin2z, and y = sm2x-+sin2zx. The graph of the sum is found 
rather quickly from the drawing by addition of ordinates at key 
points. 


Graph by addition of ordinates: (j) sin-+ J, (k) I -+ sg (see Section 5-7). 


sin + sin ΟΝ 


Fiaure 5-51 
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We define the other arithmetic operations on functions as follows: 


(2) Def. —f = {@y)ly = —f@)}. 
(3) Def. fg = τ (- 9). 

(4) Ῥεῖ. f-g = (ἰῷ, ψν}ιν = f@)9@)}. 
(5) Def. 1/f = {@, y) ly = I/f(a)}.. 
(6) Def. f/g = f+ (0). 

(7) Def. jr = {(a, ly = σ))"} 


(1) Suggest a quick way to graph —f from a sketch of f, and sketch —f and 
—g where f and g are defined by Fig. 5-50. (m) Complete: (—f)(z) = 


—______; (f — g)(v) = ——___;; (fg)(z) = (note that the dot is 
omitted when no confusion is likely); (1/f)(x) = ——____ (f/g) (x) = ————_; 
(f")(z) = _____. _ (n) Graph f — g in Fig. 5-50. Suggest two ways to 


graph f — g from the graphs of f and g. (0) What are the domains of —f, 
1/f, and 5532 (p) What are the domains of f — g, f-g, and f/g? 


From the definitions it is easy to show that functions follow laws sim- 
ilar to those for numbers. For example, f +g = g +f is true for any 
functions, since ΝΕ 


ftg={@Wly=f@t+o@} τξ. ((, ψ}ὶ]ν Ξξ φ() -Ὁ 7(.)} ξε ρ -Ἐ ἢ, 


the second equality being justified by the commutative law of addition. 
Indeed, since every operation on functions is defined by the “same” 
operation on the images, each law for numbers has a counterpart for 
functions which can be proved by applying the corresponding law for 
numbers to the images. Indeed, if we have a set of functions closed under 
addition, negation, and multiplication, axioms (1-16-1) through (1-16-9) 
are satisfied, where the constant functions 0 and 1 play the roles of the 
numbers 0 and 1. Complications arise for the reciprocal since the domain 
of 1/f does not contain the zeros of f. Hence (1-16-10) does not hold as 
it stands. (Incidentally, we do not use 77 as a synonym for 1/f since 
[ΠῚ is sometimes used to stand for the inverse f*.) It follows that we 
can manipulate the names of functions just as we would symbols standing 
for numbers, provided we take into account the domains of all functions 
involved and keep in mind that equality of functions means identical 
range and domain as well as defining formula. By taking advantage of 
this, we can state many identities without the explicit use of variables. 
For example, sin? + cos? = 1 90 Vx sin?x -+ cos? = 1. 
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(q) Complete the following: sin/cos = —____; (12 — 1)/(I+ 1) = 
-1+ cot? = _______; 2sincos = ______._ (Note that we some- 


times use a numeral without the bar under it to stand for a constant function 
where no confusion is likely.) 


Since I(x) = xz, we can construct familiar functions by arithmetic 
operations applied to J and constant functions. In particular, we have 
(al)(x) = az, 15 (4) = x", al”(x) = ax”, and (1, (4) = 1/2. If we 
form an expression from “J” and symbols standing for constant functions 
by indicating arithmetic operations, we have a name of a function whose 
defining formula is obtained by changing “x” to “J” and changing names 
of constant functions to names of the corresponding numbers. We have 
already used this idea in naming linear and quadratic functions. Thus a 
defining formula for AJ? + BI 4+ C is “Az? + Br +C,” since (AI? + 
BI + C)(x) = (AI®)(x) + (Bi)(x) + C(x) = Az? + Br +C. 


(r) Find a defining formula for: 13, 17.132, (2-+ DJ‘, 2sin [817]. 


A function obtained from J and constant functions by the arithmetic 
operations exclusive of division is called a polynomial. A polynomial 
A,l” + Anyi” 1 +...4+ Aol? + Ail + Ao with A, ¥ Ὁ is called 
a polynomial of the nth degree. A function that is the quotient of two 
polynomials is called a rational function. 


(s) Write a typical defining formula for a polynomial; for a rational function. 
(t) Sketch 13 — 1 by addition of ordinates. (u) Sketch the same function 
by drawing J and 12 — 1 and noting that 15 — I = 1(12 — 1). (wv) Sketch 
1Χ( -- τ. 


It 1s not true that we get a defining formula for a function by sub- 
stituting an independent variable for “J” in any name of the function. For 
example, “sin + x?” is not a defining formula for sin + 12: However, 
sin + 12 = sin[Z] + 12, and “sina + 2?” is a defining formula obtained 
from the second name by (/:x). Evidently, if we have a defining formula 
for a function we can always construct a name for it by putting “7” in place 
of the independent variable. For example, we did this when we adopted 
the name “|J|” for the function defined by “y = |z|.” Also, given a name 
of a function, we can always insert “J” by using the fact that F = F[J], so 
that the resulting name will yield a defining formula by the substitution 
(1:2). For example, sin{[Z] = sin. Note that the brackets, indicating 
composition, are required here, since “sin J” might be read as “sin: J.” 
When possible, we shall usually follow the practice of naming functions so 
that the substitution (1:2) yields a defining formula. Of course this may 
not always be convenient, for example when different defining formulas 
are required in different subsets of the domain. 
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(w) Write a name for function defined by each of the following: y = 2/z; 


yy =a A y = 0;23 — y = 0;y = the largest integer less than or equal to z. 


This method of naming functions has a considerable advantage for 
the purpose of quickly finding composites. For any functions F and G, 
F = ΡΠ), G = [1], and G[F] = G[F{J]]. Defining formulas are F(z), 
G(x), and G(F(x)). [See (5-9-4) and (5-9-5).] Note that a name for the 
composite is obtained by substituting F(/) for J in G(J), just as a defining 
formula for the composite is obtained by substituting F(x) for x in G(z). 
So long as we use names obtained from defining formulas by (x:/), we may 
as well work with the names directly instead of going to the defining for- 
mulas, finding a defining formula of the composite, then going back to the 
names. For example, in Exercise (c) of Section 5-9 we found (27 + 1){I?] 
by noting that (27 + 1)(x) = 2x4 1, I?(x) = x?, then using (5-9-4’) 
to get ((27 + 1)[Z7})(z) = 227 + 1. Finally, we concluded that (27 +1) 
[13] = 27? + 1. This final result could be obtained immediately by sub- 
stituting J? for J in 27 + 1. 


(x) Do Exercises (d) and (f) in Section 5-9 by the above method. (y) Com- 
plete: 11 — 7] = _____;_ I??[sin [Z]] = —____;_ [*[sin] = —_____. 


We see from the last example that a name for G[F] may be found by 
substituting a name for F in place of J in a name for G, provided only 
that the name for G is such that a defining formula of G is obtained by the 
substitution (7:2). | 


(z) Is “I” a variable? Is substitution for “J” justified by the Rule of Substi- 
tution? What is the justification for the device discussed above? 


PROBLEMS 


1. How would you construct a table of f+ g, —f, f — g, fg, 1/9, f/g, and f” 
from tables of f and g? (Such procedures are used to construct tables of com- 
plicated functions from those of simpler ones.) 

2. Assuming that you can build machines that perform arithmetic opera- 
tions on numbers, draw pictures of machines along the lines of Fig. 5-13 to 
illustrate the definitions of this section. 

3. Into what does J? -+- I — 3 map τῇ 

4, What is the transform of —3 under f -ἰ- 2g? 

5. What corresponds to —1 under Arcsin + [29 

6. Into what does (1 — J)* transform 3? 


In Problems 7 through 12 sketch ἢ, g, —f, 1/f, f+ 9,f—g9, 7. σ, f/g, and 
gif]. Name and give a defining formula of each function. 

7. =~ 2,9 = F. 8, [͵ = Bg = —I. 

9. f = sin,g = 1. 10. f = sin’, g = cos*. 

11. f = sin,g = sin [31]. 7 12. f = sin, g = sin [4J]. 
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*13. Simple musical sounds are representable by functions of the type sin [1], 
where n © J and the pitch is determined by n. Show that the sum of any two 
such functions is periodic, and find the period. 

14. Show that Asinz-+ Bcosx = Csin(x-+ D), where C = ν A? + B? 
and D satisfies cos D = A/V A? + B? A sin ἢ = B/V A? + B?. Illustrate 
your result by graphing sin -+ cos by addition of ordinates and comparing it 
with the graph of 2 sin [1 -++ 2/4]. 


In Problems 15 through 18 repeat Problem 14 for the given sum. 
15. ν ὃ sin + cos. 16. 3sin + 4 cos. 
17. sin — 2 cos. 18. 0.3 sin + 0.7 cos. 


In Problems 19 through 24 find the composite Fo G, that is, G[F], directly 
from the names by substituting the name of F in the name of G in place of 1. 
Find it also by using the defining formulas, and compare. 


19. F = 277—1,G=1— 15. 20. F = 3sin,G = 5812, 
21. F = 517,G = 3 Sin. 22. F = Sin, G = [1]. 
23. F = Arccos, G = Sin. 24. F = sg, G@ = Arcsin. 


25. Prove that f/f = 1 is not a law for functions, but that 
/f=1N{awwWlfe) =0A y = 1} isa law. 
*26.f = {(z,y)|y = ΙΕ ΞΞῚ ΚΞ ἀξ δ᾽ Ξ [ἀρ  Ξ 272A 
1< 5 “ 2}. Find and sketch f + σ, 7" σ, g/f, and 56]. 
* 27, Sketch sg + (I — 12), sg- (J — I*), and sg [J — 12]. 
*28. f(z) = 2? — 1, (f+ 9)6) = (© + 1)?; g(x) = ὃ 
*%29. Solve for F: F* —JI = 0. 
*30. Suggest a definition of Vf, where f is a function. What is its domain? 


ANSWERS TO EXERCISES 


(a) sum; f;g. (Ὁ) f(x) + g(x). (ὦ Dom (f) MN Dom (σ). (d) g(x) = 0, 
l.e., at the zeros of g. (6) f(z) = 0. (ἢ 2f(z). (g) g(x) «0. (hb) f(x) = 
—g(x). (Ὁ Three segments through dots in Fig. 5-50. (1) A wavy curve 


ip 


I+ sg 


Figure 5-52 
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oscillating about the line y = 2, varying from a distance of 1 vertically above 
to a distance of 1 vertically below; not the same as just swinging whole sine curve 
around the origin by an angle of 7/4. (k) See Fig. 5-52. (Ὁ “Reflect” 
curve in z-axis; in Fig. 5-50 reflect corners and connect. 

(m) —f(x); f(z) — g(x); flx)g(x); I/F(x); 1) σα); ([(}}. (ἡ Graph f 
and —g and add ordinates; graph f and g and subtract g(x) from f(x) at key 
points from graph. (0) Dom (—f) = Dom (5) = Dom (ἢ); Dom (1/f) = 
Dom (f) M (17) = 0).  (p) Dom (f — g) = Dom (fg) = Dom (f) ἢ 
Dom (9); Dom (f/g) = Dom (Ὁ) M Dom (1.9). (4) tan; (J — 1) ἢ {(—1, 
—2)}’; ese?; sin [2/7]. (r) «ὃ; 1,2; (a + 2)a?; 2sin 8.3, (s) A,w™ + 
Agate - τις; CARE Agere -Ἐ PA) Cat + Baie ὶ-τ-Ὲ 
+++ + Bo). 

(t) Crosses x-axis at (0, 0), (1,0), (—1,0). (v) Domain does not contain 1; 
curve does not cross line x = 1. (w) 2/2; VI; 13; [J]. 6) G — D3; 
sin?; sin?. (2) No. No. It works! It does so because we use names for func- 
tions so that it does work! 


ANSWERS TO PROBLEMS 


1. Include columns for f(x), g(x), f(x) + g(a), ete. 8. 22+ 2 — 3.5. --π7 -Ἐ 
1. 7. Defining formulas: 2, «2, —2, 1/2, 2 - 27, 2 — x?, 2.2, 2/n?, 4. 9. sin z, 
x, —sina, l/sinz, x-+sinz, sinz — 2, xsinz, (sinz)/xz, sinz. 11. sing, 
sin 32, —sinz, 1/sinz, sin z-+ sin 82, sinx — sin 3z, (sin x)(sin 81), (sin x)/ 
(sin 32), sin (8 sin 2). 15. 2sin (ὦ + 7/6). 17. ν ὃ βίῃ (x — 1.11). 25. 1 is the 
set of all ordered pairs of real numbers whose second component is 1, but f/f 
is the set of all such pairs except those pairs whose first component is a zero of f. 


5-13 The conic sections. Mathematicians of ancient Greece obtained 
a very interesting class of curves by imagining a plane cutting a right 
circular cone. As suggested in Fig. 5-53, a right circular cone is generated 
by choosing a point V (called the vertex) on a line (called the azzs) per- 
pendicular to the plane of a circle at its center, and then letting a line 
through the vertex move around the circumference of the circle. Any 
plane perpendicular to the axis meets the cone in a circle. If the plane is 
rotated, the intersection first becomes a closed curve called an ellipse. 
If the rotation continues until the plane is parallel to a straight line on 
the cone, the curve of intersection 1s called a parabola. 


(a) Prove indirectly that a parabola is not a closed curve. 


If the plane is rotated past the position for a parabola, it meets both 
halves of the cone. The intersection is called a hyperbola and evidently 
has two branches. 


(b) What happens if we let the plane pass through the vertex? 
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Parabola 


Figure 5-53 


Though originally studied for their own sake, conic sections have been 
found to have many important applications, for they appear as paths of 
projectiles, orbits of planets, shapes of mirrors and lenses, patterns of 
points in statistical scatter diagrams, and architectural designs. 3 

By using theorems of plane and solid Euclidean geometry, mathe- 
maticians discovered many interesting properties of conics. However, 
the methods of analytic geometry yield the same results more easily. It 
turns out that the conic sections all have equations of the form 


(1) Az* + Bry + Cy? ++ Dr + Ey + F = 0, 
where ~(A = B=C= 0). 
Conversely, any equation of this form (except those equations that define 


two parallel lines) is an equation of a curve that can be obtained by letting 
a plane cut a right circular cone. (Proof may be found in some textbooks 


902 RELATIONS AND FUNCTIONS [cHAP. 5 


on analytic geometry, e.g., in that by W. A. Wilson and J. L. Tracey.) 
Since all the properties of conics can be derived from such equations with- 
out reference to their historical origin as sections of a cone, it is customary 
today to define a conic as a curve with an equation of the form (1). Ac- 
cordingly, by “conic” we mean a curve with an equation of the form (1). 
When the locus is the empty set, a single point, a single line, or two lines, 
it is called a degenerate conic. 


(c) From the definition show that a circle is a conic. (d) Write an equation 
of a single point in the form (1). How could we get a single point by letting 
a plane cut a cone? (6) Write an equation of the form (1) that defines the 
null set of points in the cartesian plane. (f) Write an equation of two. inter- 
secting lines in the form (1). (8) of two parallel lines. (h) Show that the 
graph of a quadratic function is a conic. What kind? (1) Are all conics graphs 
of functions? (j) Show that the converse of a relation whose graph is a conic 
is a relation whose graph is a conic. 


Some properties of conics are peculiar to one of the ellipse, parabola, or 
hyperbola. We consider these in the next three sections. Others are com- 
mon to all conics and will be introduced here. An obvious property that 
follows immediately from the definition is 


(2) A straight line meets a conic in at most two points. 


(k) Prove (2) by considering how one would find the intersection of (1) with 
a linear equation. (1) Find the points of intersection of 2.2 - zy + y? — 
z+ 4y —6 = Owithy — x = 1. 


A less obvious property is the following: 


A plane curve is a conic if and only 1} there ts a point (called 
the focus) and a line (called the directrix) and a number (called 
the eccentricity), such that for every point on the curve the ratio 
of the distance between that point and the focus divided by the 
distance between that point and the directrix 18. equal to the 
eccentricity. 


(3) 


Because of the “if and only if,” theorem (3) asserts two implications: 
a plane curve is a conic if it has the geometric property; a plane curve is a 
conic only if it has the geometric property. The first implication is equiv- 
alent to “if a curve has the geometric property, then it is a conic.” The 
second implication is equivalent to “if a plane curve is a conic, then it has 
the geometric property.” It is easy to prove the first implication. Suppose 
that a curve has the indicated property. Then by the Rule of Choice 
(2-10-9), let the eccentricity be e, the focus be (h,k), and the line 
be defined by Ayx-+ Byy + C, = 0. Then the locus is the set of 


5-13] THE CONIC SECTIONS 303 


points (xz, y) satisfying 


: [Aiz + Byy + Ci| 


Bey (emg ay alae ag 
ὑπ: Bi νῷ — h)? + (y — k) 


(4) 


(m) Justify (4). Show that e > 0. 


Now (4) is not in the form (1), but squaring both members yields an 
equivalent equation of the form (1), which proves that every curve with 
the geometric property is a conic. | 


(n) Find an equation in the form (1) of the locus of points such that the dis- 
tance from (2, 3) is twice the distance from x-+ y — 1 = 0. (0) Justify the 
last step in the above proof by carrying out and explaining the algebra. 


The proof of the second part of (4) is not so easy. We shall accomplish 
it by showing in Sections 5-14 through 5-16 that it holds for the special 
cases of ellipse, parabola, and hyperbola, and then showing in Section 
5-17 that every conic is one of these or a degenerate case. We shall find 
that the second part of the theorem does not hold for the circle and de- 
generate cases if we interpret it literally. However, by considering these 
special conics as “limiting cases,” we squeeze them into a liberal inter- 
pretation of the theorem. 

It is possible to tell from the equation of a conic whether it is an ellipse, 
parabola, or hyperbola. Indeed, we have 


A conic whose equation ts in the form (1) is an ellipse, parabola, 
(5) or hyperbola according as Β΄ — 4AC is less than, equal to, or 
greater than 0. 


The type of curve is determined also by the eccentricity according to 
the following: 


(6) A conic is an ellipse, parabola, or hyperbola according as the 
eccentricity 1s less than, equal to, or greater than one. | 


These theorems are proved in the following sections. 


PROBLEMS 


1. Show that (1) is the equation of a circle if B = Ο and A = C. 

2. Show that any circle has an equation of the form (1) with B = 0 and 
A = (C, 

3. Show that B? — 4AC < 0 for a circle, in conformity with (5). 
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4. Calculate B? — 4AC for the quadratic function defined by y = ax? + 
bx ++ c after putting the equation in the form (1). Does your result conform to 
(5)? 


Graph Problems 5 through 10, show that they are conics, and identify the 
type from the graph. 


δ. ay = 2. 6. 227 -+ y? = 1. 
7. 22 — y? = 2.. 8. 227 — νυ = 0. 
9, x7 + Qry+ y? = 0. 10. (2x + ὃν — 4)(x — 5y+ 10) = 


Find the members of the sets defined by the seats decks equations in 
Problems 11 and 12. 


ll. zy =liaty - 4. 
12. 17+ 4ἀγῳ — 3y7+ 2. — ὄν +7 =  0Ϊ;5 — 2y — ὃ = 0. 


For Problems 13 through 16 find the equation in the form (1) of the conic 
with the indicated eccentricity, focus, and directrix. Graph. 


13. 1, (4,0), « = —1. 14. 0.5, (1,0), 2 = —1. 
15. 2, (1,0), 2 = —1. 16. 0, (1,0), = —1. 


17. Verify that your results in Problems 5 through 10 conform to (5). 
18. Verify that your results in Problems 13 through 16 conform to (6). 


FIGURE 5-54 


19. The equation of a conic takes a very simple form in polar coordinates if 
the focus is placed at the origin, and the directrix parallel to one of the axes. 
Figure 5-54 shows a focus at (0, 0) and a directrix parallel to the y-axis through 
the point (k, 7), with k > 0. Show that an equation of the conic with 
eccentricity 6 is 


(7) y = ke/(1 — e cos 8). 


Graph the conics in 20 through 25 by plotting selected points. Sketch the 
directrix in each case, and indicate the eccentricity. 


20. r = 1/(1 — cos 6). 21. r = 27( — 0.5 cos 8). 
22. r = 2/(1 — 2 cos 8). 23..r = 3/(38 — cos 8). 
24. r(2 — 5cos 6) = 0. 25. τί! -Ἐ cos @) = 0. 
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*26. Find the form of the equation if the directrix is parallel to the y-axis 
but to the right, parallel to the z-axis and above, and parallel to the z-axis 
and below. 

* 27. Construct and graph examples illustrating your results in Problem 26. 


ANSWERS TO EXERCISES 


(a) If it were closed it would have to meet the line in the cone to which the 
cutting plane is parallel. (0) Two lines, one line (plane tangent to cone), or 
a point. (c) See Section 4-8. (4) (« — h)? + (y — k)? = 0 defines 
{(h, k)}. (e) a? -+ y? = —1. (ὃ 2? — 4y? = 0. (g) Expand (+ y+ 1) 
(x-+y+2) = 0. See Problem 21 in Section 5-6. (8) In form (1) with 
(B:0, 6:0); parabola. (i) No; example: two intersecting straight lines. (7) In- 
terchanging variables does not change form (1). (k) Solving linear equation 
for one variable and substituting in (1) yields a quadratic, which has at most 
two roots. (1) (1/4)(—3 + νΊ8,1 + 18) and (1/4)(—3 — V13, 1 — V/18). 

(m) (4-7-10) and (4-3-14); we must have e > 0 since the other terms are 
non-negative. (n) 22+ τυ + y2 + 2y — 11 = 0. (ὁ) First step: 
e2(Ayaz + Bry + Ci)? = (A? + B%)((2 — hb)? + (y — k)?). Then let AZ+ 
B? = A? and carry on bravely. The equations are equivalent to (4) since 
a? = $2 <> |a| = [Ὁ] is alaw. [Prove it from (3-9-10) and (2-8-38)]. 


ANSWERS TO PROBLEMS 


1. See Section 4-8. 3. B = 0, A = C. Hence AC > 0 and —4AC < 0. 
5. Hyperbola similar to Fig. 5-5. 7. Hyperbola, Fig. 5-5, rotated through 
—45°, 9. Single line, y = —z. 11. (a, 1/a) where a = 24 V6. 13. y? — 
4¢ = 0. Parabola. 15. 412 — y2+ 10+ 3 = 0. Hyperbola. 17. 6 is ellipse; 
8 and 10 two lines (a degenerate hyperbola). 19. Equation before simplifying 
is r/(k + rcos 6) =e. 21. Ellipse. 23. Divide numerator and denominator 
by 3 to put in form (7). 


5-14 The ellipse. The equation 
(1) 7 +2 =1 


is in the form (5-13-1) with (A:1/a”, B:0, C:1/b?, D:0, E: 0, F: —1). 
We let a and b be positive. Solving for y and x we find 


(2) | y = £(b/a)Va? — 2?, 


(3) x= 4(a/b)Vb? — y?. 


Since we are considering only real values of the variables, (2) shows that 
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lee 


| 
! 
dy (x,y) (a, 0) | 
| 


Figure 5-55 


we must have a? — 2? > 0, that is, |z| < a. Similarly, from (3), [0] < Ὁ. 
This shows that the locus is entirely within or on the boundary of the 
rectangle defined by —b < y XbA --αᾶ “κα “ a. 


(a) Carry through the manipulations to obtain (2) and (8). (Ὁ) Justify the 
conclusions drawn from them. 


Substitution shows that the points (a, 0), (—a, 0), (0, 6), (0, —b) lie 
on the locus. They are called vertzces of the curve. By plotting a few more 
points, the reader can convince himself that the curve is shaped as in- 
dicated in Fig. 5-55. Equation (1) is called the standard form of the equa- 
tion of an ellipse. The segments joining (a, 0) to (—a, 0) and (0, ὃ) to 
(0, —b) are called the azes of the ellipse, and their intersection is called 
the center. The longer one is called the major axis, the smaller one the 
minor axis. 


Graph the following by plotting numerous points: (0) 22/9 + y?/4 = 1, 
and (d) 27+ y?/4 = 1, (ὁ) Show that x = acost, y = bsin#é are para- 
metric equations of the ellipse defined by (1), i.e., that by letting ¢ take all real 
values we get precisely the points satisfying (1). 


It is convenient to use the concept of symmetry to describe the ellipse. 
We say that iwo points are symmetric with respect to a line if the line is the 
perpendicular bisector of the segment joining them. 


(f) We have shown that the points (2, y) and (y, x) are symmetric to the line 
defined by y = x Where? (g) Show that the points (2, y) and (—z, y) are 
symmetric with respect to the y-axis. (h) Show that the points (z, y) and 
(x, —y) are symmetric with respect to the z-axis. 


We say that a set of points 1s symmetric with respect to a line if, cor- 
responding to each point in the set and not on the line, there is another 
point in the set such that the pair of points is symmetric with respect to 
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the line. The line is called an axis of symmetry of the set of points. I*rom 
Fig. 5-55, it appears that an ellipse is symmetric with respect to each of 
its axes. To prove this we note that if (a, y) satisfies (1), then so do 
(—2, y) and (2, —y). The result follows from Exercises (g) and (h). 

We now show that (5-13-38) holds for the ellipse defined by (1). Tor 
this purpose we suppose that a > ὃ, and let ὁ = Va? — b?. Then we 
prove that (—c, 0) is a focus and that the line defined by x = —a?/c 
is the corresponding directrix of the curve. Letting the eccentricity be 
c/a, equation (5-13-—4) becomes 


(4) (c/a)|x + a?/el = V(x + 0)? + y?. 


This equation defines a set of points whose distances from (—ce, 0) and the 
line defined by x = —a*/c are in the ratio c/a. Squaring the equation, 
simplifying, and making use of the fact that a? — c? = b?, we find (1). 
Hence a focus of the ellipse defined by (1) is (—c, 0), the corresponding 
directrix is the line defined by x = —a?/c, and the eccentricity is c/a. 
Figure 5-55 shows the focus and directrix and the distances r; and dy 
for a typical point (2, y). 


(i) Carry through the algebra to show that (4) is equivalent to (1). (j) Show 
that (c, 0) and x = a?/c are an alternative focus and directrix. (k) Show that 
in conformity with (5-13-5), B? — 4,6 «0 in (1. (Ὁ Show that in con- 
formity with (5-13-6), the eccentricity of (1) is less than one. (m) What 
happens if a = b? What about the foci, the eccentricity, the directrices? 
(n) What is the relation between a, ὃ, andc if a « δ᾽ Where are the foci? the 
directrices? 


PROBLEMS 


In Problems 1 through 6 sketch the graph of the equation, indicate the 
vertices, the lengths of the major and minor axes, the foci, the directrices, and 
the domain and range of the relation defined by the equation. 


1, 22/44 y?/16 = 1. 2. 27/9 + y?2/25 = 1. 
3. 422 + Oy? = 36. 4, 10022 + y? = 100. 
δ, 1622 + y? = 4. 6. 427 + Oy? = 1. 


In Problems 7 through 12 sketch the ellipse determined by the given condi- 
tions, and write the equation of each one. 


7. Focus at (3, 0), vertex at (5, 0), center at origin. 
8. Center at origin, vertices at (1, 0) and (0, 2). 
9. Center at origin, focus at (3, 0), eccentricity 0.75. 
10. Vertices at (0, —3), (2, 0), (0, 3), (—2, 0). 
11. Focus at (—3, 0), corresponding directrix at x = —16/3. 
12. Eccentricity 0.5, major axis of length 10 and along y-axis, center at origin. 
% 13. Half the length of the major axis is sometimes called the semimajor azis. 
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The earth’s path is elliptical with the sun at one focus. Taking the semimajor 
axis as 92.8 million miles and the eccentricity as 0.1, write an equation of the 
path. | 

*14. Do the same for the path of Mercury, whose eccentricity is 0.2 and semi- 
major axis is 0.39 times that of the earth’s orbit. 

* 15. Why did we assume a and ὃ positive in (1)? 

* 16. Prove that (1) is an equation of the set of points the sum of whose dis- 
tances from (—c, 0) and (c, 0) is 2a, where c? = a? — b? anda > ὃ. (This 
is sometimes taken as a defining property of the ellipse.) 

*17. Formulate and prove a similar result for the case where a < ὃ. 

* 18. Show that the parametric equations x = 2asinicost, y = ὃ cos*t — 
ὃ sin*t define an ellipse. 

*19. Suggest a way of constructing an ellipse by using the property of 
Problem 16. 

*20. What can you say about the family of ellipses given by (a2 — ¢2)x? -+ 
a*y* = a*(a? — c?) for different values of a and c? 


ANSWERS TO EXERCISES 


(a) See Section 2-8. (c) Major axis horizontal. (d) Major axis vertical, 
vertices at (1,0), (—1, 0), (0, 2), (0, —2). (6) Substitute in (1) to show that 
any point given by them lies on locus; also, all values of ᾧ in (—a a) are 
given by = acost. [Compare the discussion of (4-5-1) and (4--5 -ὃ).} (ἢ) In 
Section 5-10. (6) The line joining the points is obviously perpendicular to 
the z-axis, and the points are equidistant from the z-axis. (8) Similarly. 

(i) Square and simplify. (1) Equation (4) is the same except that the first 
two plus signs are changed to minus signs. (Κ) Since here B = 0, and A 
and C are greater than zero. (1) c* = a? — b?, hence c < a, and c/a < 1. 
(m) A circle, zero eccentricity, foci at center, directrices “at infinity”: the circle 
is a-limiting case of an ellipse in which ¢ approaches zero. (n) c? = δ — a?, 
In all cases οἷ = ja? — 82]. The foci at (0, 6), (0, —c). The directrices are 
given by y = b?/c and y = —b?/c. 


ANSWERS TO PROBLEMS 


3. Divide both members by 36. 5. Divide both members by 4, then divide 
numerator and denominator of first term by 4. 7. x?/25-+ y?/16 = 1. 
9. 27/16 + y2/5 = 1. 11. 22716 -Ἑ y?2/5 = 1. 18. 2?/(92.8)2 + y?/(0.99) 
(92.8)? = 1. 15. For convenience, so that a and ὃ would be half the lengths of 
the axes. 17. Sum of distances from (0, c) and (0, —c) is 2b, where c? = b? — a?. 
Use distance formula to express distances. Set sum equal to 2a. Manipulate 
so that one square root is alone on one side and square. Simplify and repeat. 


5-15 The hyperbola. The standard form of the equation of a hyper- 


ἫΝ 
bola 15 ᾿ ᾿ 


oy y 
(1) | aes ee 


A discussion of this equation is naturally very similar to that of the 
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standard form of the equation of an ellipse in the previous section. Ac- 
cordingly we follow such a pattern, leaving it to the reader to fill in the 
details. 

Equation (1) is in the form (5-13-1) with 


(a) Complete the previous sentence and equations (2) and (3). 


As before we let a and ὃ be positive. Solving for y and x, we find 
(2) y= 
(3) 7 ΞΞ ΞῈ 


Since we are considering only real values of the variables, (2) shows that 
we must have x” — a? > 0, that is, 
(b) Complete the previous sentence. 


However, since y”? + b? > Ὁ for all real y, the range of y is the real 
numbers. 

Substitution shows that (a, 0) and (—a, 0) lie on the curve. We call 
these the vertices of the hyperbola. By plotting a few more points, the 
reader can convince himself that the curve is shaped as indicated in Fig. 
5-56. The segment joining the vertices 1s called the princzpal azis. Its 


| | 
x = —a*/c x = a*/e 


| y = bx/a 
| 
| 
| 


FigurRE 5-56 
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midpoint is called the center of the hyperbola. The perpendicular bisector 
of the principal axis is called the transverse axis. 


Graph the following by plotting numerous points: (ὁ) 27/9 — y?/4 = 1, 
and (4) 27 — y#/4 = 1. (ὁ) Show that x = asect, y = atan? are 
parametric equations of the hyperbola defined by (1). (ἢ) Show that (1) is 
symmetric with respect to the x-axis and with respect to the y-axis. 


Symmetry with respect to a line is called axial symmetry. Another 
type of symmetry, called central symmetry, is defined with respect to a 
point. Two points are said to be symmetric with respect to a point if the 
point is the midpoint of the segment joining them. 


(g) Show that the points (z, y) and (—2z, —y) are symmetric with respect to 
the origin. 


A figure is said to be symmetric with respect to a point called the center if, 
corresponding to each point of the figure not at the center, there is an- 
other point of the figure such that the pair of points is symmetric with 
respect to the center. Both the ellipse and the hyperbola appear to possess 
central symmetry. To prove this it is sufficient to observe that their 
equations are equivalent to the equations we get by the substitution 


(εχ ΣῊΝ 
(8) Why is this sufficient? 


We now show that (5-18-3) holds for the hyperbola defined by (1). 
Letting c = ν α + 62, we prove that (—c, 0) is a focus and that the line 
defined by x = —a?”/e is the corresponding directrix of the hyperbola. 
Letting the eccentricity be c/a, equation (5-13—4) becomes 


(4) (c/a)|x + a*/el = “(ὦ + ¢)* + y”*. 


This equation defines a set of points whose distances from (—c, 0) and the 
line defined by x = —a*/c are in the ratio c/a. Squaring the equation, 
simplifying, and making use of the fact that c? = a? -Ε b?, we find 
(1). Figure 5-56 shows the focus and directrix and the distances 7; and 
d, for the typical point (2, y). 


(i) Carry through the algebra to derive the above results. (7) Show that 
(c,0) and x = a2/c are an alternative focus and directrix. (k) Show that in 
conformity with (5-13-5), B* — 4AC > 0 in (1. (1) Show that in con- 
formity with (5-13-6), the eccentricity of (1) is greater than one. (m) Must 
the discussion be altered according to whether a < 6 or not? 


Tigure 5-56 shows two lines with equations y = bx/a and y = —ba/a. 
It appears that the hyperbola approaches these lines. 
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(n) Draw the lines on your graphs of Exercises (c) and (d). 


To show that the curves actually do approach these lines, we consider h, 
the directed distance parallel to the y-axis from the hyperbola to the line. 
We consider first the branch of the hyperbola in the first quadrant and the 
corresponding line there. We have 


(5) h = θα α — (b/a)Vx?2 — a? = (b/a)(x — Vx? — a?). 


Now we multiply and divide the right member by x -++ να — αὐ. The 
result is | 


(6) h = (b/a)(x? — («* — a?))/(@ + V2? — a?) 
(7) = (αδ)  - Va" — a’). 


(o) Why is this multiplication and division justified? (p) Explain (6) 
and (7). 


Now consider what happens to A when x gets very large. Certainly ἢ 
is positive, since both numerator and denominator are positive. Hence 
the hyperbola remains below the line. However, as x gets large, the de- 
nominator gets large without limit. Hence ἢ gets small. Indeed, by going 
out far enough (1.6., by taking x large enough) we can make ἢ as small as 
we like. We describe this by saying that the hyperbola approaches the 
line, and we called the line an asymptote of the curve. 


(q) Show that in the fourth quadrant the hyperbola approaches the line 
y = —b2z/a as x increases. 


A quick way to sketch a hyperbola is to draw the asymptotes and 
vertices, then draw a curve through each vertex approaching the asymp- 
totes. An easy way to remember the equations of the asymptotes is to 
note that they are obtained from the equation of the hyperbola by putting 
0 in place of 1 in the right member of (1), for we have 


ny? 
(8) a2 pe = 9 (Δα + y/b) (e/a — y/b) = 0 

(9) “Ὁ x/a + y/b = 0 V x/a — y/b = 0 
(10) —y = —br/a V y = da/a. 


Hence (8) is the equation of the union of the two asymptotes. 


(r) Justify (8) through (10). 
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PROBLEMS 


In Problems 1 through 6 sketch the graph, indicate the vertices, the foci, the 
directrices, the asymptotes, and the domain and range of the relation defined 
by the equation. 


1. 22/4 — y?/16 = 1. 2. 22/9 — y?/25 = 1. 
3. 4." — Qy? = 36. 4, 100x2 — y? = 100. 
5. 1627 — y* = 4. 6. 427 + Oy? = 1. 


In Problems 7 through 12 sketch the hyperbola determined by the given condi- 
tions and write equations for each one. 


7. Focus at (5, 0), vertex at (3, 0), center at (0, 0). 

8. Center at origin, vertex at (2,0), focus at (8, 0). 

9. Center at origin, focus at (3, 0), eccentricity 2. 
10. Center at origin, focus at (0, 6), vertex at (0, 4). 


11. Focus at (—3, 0), corresponding directrix atx = —1. 
12. Eccentricity 1.5, principal axis of length 10 along z-axis, center at origin. 
13. Graph «279 — y?/4 = —1. 


*%14. Discuss the relation defined by (1). What is its domain? its range? Ex- 
press it as the union of two functions. What is the appearance of its inverse? 
Illustrate by sketching a particular hyperbola and its inverse. 
* 15. Treat the ellipse as in Problem 14. 

16. Prove that (1) is an equation of the set of points the difference of whose 
distances from (—c, 0) and (c, 0) is 2a, where c? = a? + 6%. 
*17. Graph «2 — y? = 1 and compare with Fig. 5-5. 
#18. Prove that if a figure is symmetric with respect to two perpendicular lines, 
it is symmetric with respect to their intersection. 


ANSWERS TO EXERCISES 


(a) (A:1/a?, B:0, C:—1/b?, D:0, E:0, F:—1); (3) (b/a)\/z2 — a?; (4) 
(a/b)\/y2 +62, (Ὁ) |x] > a. (ὁ) Vertices at (3,0), (—3,0).  (d) Ver- 
tices at (1,0), (—1,0). (ὁ) Like Exercise (6) in Section 5-14. See (4-12-7). 
(ff) (1) «» (1γ(ψ:--ν) © (1)(@:—2). (6) The two triangles formed by the 
component vectors are congruent. (h) Since this shows that (x, y) lies on the 
graph if and only if (—2z, —y) does, and each such pair issymmetric. (k) Since 
here A > QO and C «0. (1) Since c? = αὐ + b?,c > α. (m) The treat- 
ment is the same regardless of the relation between a and ὁ. (n) y = +27/3 
in Exercise (c) and y = +2x in (d). (o) (1-14-2). (p) (1-12-11). 
(q) Here h < 0; but manipulations are similar. (r) (1-12-11), (2-8-8). 


ANSWERS TO PROBLEMS 


7. 22/9 — y2/16 = 1. 9. «2/9 — y2/27 = 1. 11. 3,8 — 92/6 = 1. 
13. Vertices (0, 2) and (0, —2). Asymptotes, y = +22/3. 
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5~16 The parabola. A parabola is a conic with eccentricity equal to 1. 
Let the focus be at (c, 0) and the directrix be defined by x = —c, where 
c > 0 asin Fig. 5-57. Then the condition that the ratio of the distance 
r from the focus to the distance d from the directrix be 1 is r/d = 1 or 
r=d. Since r= VW(a — c)? + y? and d= |x-+ cl, we have, after 


squaring and simplifying, 


(1) y? = Acz. 


The curve is easily sketched by substituting values of y. Indeed, (1) is 
equivalent to 2 = y”/4c, which defines the converse of the quadratic 
function 12,4. defined by y = 27/4e. 


Graph the following parabolas, indicating focus and directrix: (a) y? = 45, 
(b) y2 =z, (ὦ y*? — 7x = 0. 

(d) Express the relation defined by (1) as the union of two functions whose 
intersection is the origin. (6) Show that (1) is of the form (5-13-1) when it 
is written y2 — 4.5 = 0. | 2 | 


The graph in Fig. 5-57 looks very much like the graphs of the quadratic 
functions in Section 5-8. Is the graph of any quadratic function a parabola? 
To show that the answer to this question is affirmative, we first show that 
the equation of the graph of any quadratic function can be put in the 
form y = ax? by appropriately placing the axes. From Section 5-8 we 
know that the graph of y = αὐ - δχ - ας has vertex (—b/2a, 
(4ac — b”)/4a). We are going to show that if we move the axes so that 
the origin is at this vertex, the equation of the curve becomes y’ = az’?, 
where (z’, y’) are coordinates of points with respect to the new axes. 

Let us imagine, as suggested in Fig. 5-58, that the axes are moved with- 
out any rotation so that the coordinates of the new origin O’ with respect 
to the old origin O are (xo, yo), and the new axes are parallel to the old. 


FIGURE 5-57 FIGuRE 5-58 
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Consider a point A whose coordinates with respect to the old axes are 
(x, y) and with respect to the new axes are (x’, ν΄). These coordinates are 
components of the vectors OO’, OA, and O’A respectively, as indicated in 
the figure. Evidently 


(2) (x, y) πεῖ (Xo, Yo) =e Ce y’). 


(f) J ustify (2) by an argument not depending on a figure. 


Hence we have (a, y) = (z’ + 20, y’ + Yo), which is equivalent to 
each of the following two pairs of equations. 


(3) c= e+ toy = y+ Yo, 
_ (Equations of translation) 
(4) n= 2 — X%o,y' = ¥ — Yo- 


The equations give us the relationship between the coordinates of a fixed 
point with respect to two different axes. When the axes are moved with- 
out rotation, we speak of a translation. Equations (2) and (4) refer to a 
translation through the vector (Zo, Yo). 

Let us consider now the quadratic function 212 — 87 — 4 defined by 
y == 2x7 — 3x — 4 and the sketches in Fig. 5-33. In (5-8-2) we rewrote 
the defining formula as 2(z — 3/4)? — 41/8. Hence the same graph is 
defined by the equation y + 41/8 = 2(x — 3/4)?._ Now suppose we 
let χ' = x — 3/4, y' = y + 41/8. Then the equation becomes y’ = 
2x’*, But these equations are (4)(x9:3/4, yo: —41/8). Hence they cor- 
respond to a translation of the axes to the vertex (3/4, —41/8). We see 
that in relation to the new axes, the equation is y’ = 2.2. 


(g) Draw a sketch showing the curve and the two pairs of axes. 


More generally, for any quadratic function al? + bI -++ e defined by 
(5) y=—ax*+brte, 
we have from (5-8-3) the equivalent defining equation, 
(6) y — (4ac — b*)/4a = a(x — (—bd/2a))?. 


A translation to the point (—b/2a, (4ac — b*)/4a), that is, to the vertex 
of the curve, yields the new equation 


(7) ν' = ax”, 


as we wished to prove. 
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(8) Graph y = 312 — 4x - 2 by finding the vertex, translating the origin 
there, and graphing with respect to the new axes. 


We have shown that the graph of an equation of the form y = ax? + 
bx +c has an equation y’ = az’? with respect to new axes obtained by a 
translation. Now, interchanging x’ and y’ is equivalent to rotating the 
curve about the line y = 2, and such a rotation certainly does not change 
its shape or size in any way. But the interchange yields the equation 
a’ = ay’? or y’? = (1/a)x’, which is of the form (1) and hence is a parab- 
ola. Hence the graph of any quadratic function is a parabola. 


(i) Show that the graph of any equation of the form x = ay?+ ὃν τι ς, 
where a ¥ 0, is a parabola. Indicate its focus and directrix. (k) Find the 
focus and directrix of the graph of y = az?+ bz+c. () Consider the 
graph of (5-13-1) when A = B= D=Oo0rB=C=EH=0. (m) Verify 
(5-13-5) and (5—13-6) for parabolas. 


PROBLEMS 


In Problems 1 through 8 sketch the curve and find its focus and directrix. 


1. μὲ = 52. 2. Qy* = 81. 

3. y? = —Az. 4, y? = 4x — 2. 

5. y2 + ὃν = 4. 6. 2 = 2y? — ὃν - 7. 

7. 38y2 + 224 + ἄν — 5 = 0. 8. 27 — 2y+4 = 0. 

9. The axes are translated through the vector (2, —3). The old coordinates 


of a point are (—8, 3). What are its new coordinates? 

10. In the same situation, what are old coordinates of a point whose new 
coordinates are (0, —2)? 

11. Under a translation, the point whose old coordinates are (3, —7) has 
coordinates (—1, —6) in the new system. To what point was the origin 
translated? 

* 12. Sketch the graph of y = 32-+ 2. Translate the origin to its y-intercept, 
and find the equation with respect to the new axes. 


In Problems 13 through 16 find an equation of a conic with respect to focus 
and directrix placed so that the equations are not in standard form. Use 
(4-7-10) to find the equation and put it in the form (5-13-1). 

% 13. Eccentricity 1, focus (3, 2), directrix + y = 1. 
%*14. Eccentricity 2, focus (8, 2), directrix x + y = 1. 

* 15. Eccentricity 4, focus (—3, 1), directrix y = 0. 

% 16. Eccentricity 4, focus (—3, 1), directrix y = 42 +- 3. 


*17. Show that the equation of any circle can be put in the form x? + y? = r? 


by translating the origin appropriately. Illustrate by discussing Problem 1 in 
Section 4-8. 
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ANSWERS TO EXERCISES 


(a) (1,0),2 = —l. (Ὁ) (1/4,0), = —1/4. (ὦ (7/4,0), 2 = —7/4. 
(d) {(v, y)|y? = 4ex} = {(a,y) ly = 2Wez} UL(aWly = --ὅν ὦ}. 
(e) (4 Ὁ, B:0,C:1, D:—4c, Ε Ὁ, F:0). (f) Since the two sets of axes are 
parallel and have the same scales, the components of a vector are the same 
with respect to them. To arrive at a point A, we may go from the old origin via 
the vector (zx, y) or from the old to the new origin via the vector (xo, yo) then via 
the vector from the new origin to the point (2’, y’). Since the two paths have 
a common beginning and ending, we have (2) immediately regardless of the 
position of the axes or the point. (g) New axes with origin at (3/4, —41/8). 

(ἢ) Vertex is (2/3, 2/3). (i) Complete the square on y. Procedure 
is as befcre with (x:y, y:z). Focus: ((4ac — b*-+ 1)/4a, —b/2a), directrix: 
x = (4ac — b? — 1)/4a. (k) (—b/2a, (4ac — b2-+. 1)/4a) and y = 
(4ac — b? — 1)/4a. (1) In the first case the equation takes the form Cy? -+ 
Ey + F = 0, which is an equation of two lines parallel to the z-axis, one such 
line, or the empty set according as EH? — 4CF > 0, ΞΟ, or <0. 


ANSWERS TO PROBLEMS 


1. (5/4,0),2 = —5/4. 3. (—1,0), 2 = 1. (Note that the parabola is open 
to the left and the range of x is the nonpositive reals.) 5. (7/16, —3/2), 
x = —25/16. 7. (11/2, —2/3), x = 35/6. 9. (—10, 6). 11. (4, —1). 


18. (ety — D/V/2 = Vi — 3)2+ (y — 2)2, which reduces to «2 — 


Qry + y? — 10z — ον + 25 = 0. 


*5-17 The general case. In the previous section we showed by ap- 
propriate translations of the axes that an equation of the form 


(1) Ax? + Bry + Cy? + Dx + Fy + F = 0, 


with 
A= B= 0-4-6 <0 AD 50 


or with 
B=C=0AAAFODOAE #0, 


is an equation of a parabola. By similar methods we now show that (1) 
with B = 0 and neither A nor C zero is an equation of an ellipse or 
hyperbola. 

Suppose we translate the axes through the vector (h, k). Using (5-16-3) 
(to:h, yo:k) in 
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Q) Ax? + Cy? + Dx + Ey t+ F=0 
and simplifying, we find 


(3) Aa’? + Cy’? + (2Ah + D)x’ + (2Ck + E)y’ 
= —(Ah? + Ck? + Dh + Ek + F). 


Clearly, if we let h = —D/2A and k = —E/2C, the equation takes 
the form : 

(4) Ar’? + Cy’? = K, 

where Καὶ = —(Ah? + Ck? + Dh + Ek + F). Hence, by an appropriate 


translation we can eliminate the linear terms in an equation of the form (2). 


(a) Carry through the substitution to reduce 2x7 + 5y?-+ 22 — ὃν — 10 = 0 
to the form (4). Graph the result in relation to the new axes. (b) Suggest a 
method of finding K that is easy to remember. 


Assuming that Καὶ σέ 0, we may divide both members of (4) by it and 
rewrite (4) in the form 


(5) τσ τοι. 


We may assume A > 0, since if it is not we could multiply both members 
of (4) by —1. If C > Oalso and Καὶ > 0, then K/A > Oand K/C > 0. 
Hence there are real numbers a and ὃ such that K/A = a? and K/C = b?, 
and (5) takes the standard form of the equation of an ellipse. IfC > Oand 
K «0, then K/A and K/C are negative, and the locus contains no 
points. In this case we call the locus an imaginary ellipse. Now ifC < 0 
and Καὶ > 0, we have K/A = a* and K/C = —b?, and (5) reduces to the 
standard form of the equation of a hyperbola. If C < 0 and K < 0, 
K/A = --α and K/C = b?. An interchange of x and y then puts the 
equation in the standard form for the hyperbola. Finally, when Καὶ = 0, 
(4) 1s an equation of the single point given by 2’ = y’ = Oif A/C > 0, 
or of a pair of straight lines through the origin in A/C < 0. 


(c) Justify the last statement. 


We have now considered every case for (1) with ~(A = B= C = 0) 
for which B = 0. We found that the equation defined a parabola, ellipse, 
or hyperbola or else the null set, a single point, a line, or a pair of lines. 
We refer to these last four cases as degenerate conics. With this under- 
standing, we can say that (1) with B = 0 and ~(A = C' = 0) always 
defines a conic. 
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Figure 5-59 


To complete our proof that any equation of the form (1) with 
~(A = B=C = 0) defines a conic, we must consider the case in which 
B =~ 0. We shall show that the xy-term may always be eliminated by an 
appropriate rotation of the axes about the origin. Before we can do this, 
we need the equations giving the relation between the old and new co- 
ordinates of a point when the axes are rotated. 

In Fig. 5-59 we show the point A, the original axes, the new axes, and 
the angle ¢ (Greek “phi”) of rotation. In polar coordinates the point A is 
given by (r, θ)» with respect to the new axes and by (r, 8+ φ), with 
respect to the old axes. According to (4-11-2), we have x/r = cos (θ + 9), 
y/r = sin (0 + ¢), 2’/r = cos 0, y’/r = sin 6. Now we apply (4-12-10) 
to the expression for x/r to get 


(6) | x/r = cos @cos¢ — sin @sin¢ 


| 


(7) (x'/r) cos @ — (y’/r) sin φ. 


Multiplying both members by r we find the first formula of (8) giving the 
old coordinates in terms of the new. 


x = x'cos¢ — y’sin ¢, 
(8) (Equations of rotation) 
y = «sing + y’cos ¢. 


(d) Derive the second equation in (8) by considering the formula for y/r. 


If in (1) we replace x and y by their synonyms from (8), we find © 
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(9) (A cos?¢ + B sin ¢ cos φ + C sin2$)zx’2 
+ ((C — A) sin 26 + B cos 24)" γ' 
+ (A sin’¢ — Bsin d cos ¢ + C cos’) y’” 
+ (D cos ¢ + Ε βίη ¢)2’ 
+ (—Dsin ¢ + E cos φ)ν' + F = 0. 


(e) Verify (9) by making the replacement and simplifying. 


Evidently we can eliminate the term involving the product 2’y’ if we 
can so choose ¢ that (C — A) sin 2¢ + Bcos2¢ = 0. But this is equiva- 
lent to cot 26 = (A — C)/B. Since the range of cot is the real numbers, 
there is a value of ¢@ such that this equation is satisfied, and for this rota- 
tion, equation (1) reduces to the form (2), of which every case has already 
been considered. Hence we have proved that every equation of the form 
(1) with ~(A = B = C = O) defines a conic. 


(f) Carry through the rotation to eliminate the xy-term for the equation 
xy = 1. Graph it in relation to the old and new axes. 


PROBLEMS 


The derivation of equation (4) shows that when A ¥ 0 A C = 0 the center 
of the graph of (2) is at (—D/2A, —E/2C) and that the equation with respect 
to axes translated to the center is (4). Use this procedure to graph 1 through 8, 
showing both old and new axes. 


. 827 + 8y2 + Br — 12y — 2 = 0. 

. 16x? + 23 — 322+ 16y +16 = 0. 
a* — 4y? — 6x — l6y — 11 = 0. 
4x7 — Qy?-+ 8x-+ 72y — 264 = 0. 
227 + y? — 82+ 6y + 17 = 
2x? +- y? — 84+ 6y + 20 = 0. 
227 — ἡ — 8x — by — 1 = 0. 
227 — y? — 8x — by +1 = 0. 


Ὁ ONDA wR wwe 


. An alternative way of deriving (4) and of simplifying an equation in the 
form (2) in any particular case is to complete the square on the terms in each 
variable separately, as we did for one variable in Sections 5-16 and 4-8. Deal 
with Problems 1 through 8 in this way. 

* 10. Derive formulas giving the new coordinates in terms of the old after 
rotating the axes through an angle ¢. Do this in two ways: first by solving (8) 
for x’ and y’, second by relabeling Fig. 5-59 so that 6 is the angle of A in the 
old system and @ — ¢ is its angle in the new. 
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%11. Find approximately the angle ¢ to eliminate the zy-term in 52? — 4ay + 
22 = 0. Find sin@¢ and cos ¢ exactly. 

*12. Letting A’, B’, C’ be the coefficients of 2’2, χ' ψ', and y’2, respectively, 
show that A’ + C! = A+ C and B’? — 4A’C’ = Β5 — 4AC. 

*13. The previous problem shows that rotation does not change the value of 
the expression formed by squaring the coefficient of zy and subtracting four 
times the product of the coefficients of x? and y*. Show that this expression is 
not changed by a translation. 

*14. Explain how the results of Problems 12 and 13 suffice to secant 
(5-13-5). 

*15. By rewriting (5-13-7) in rectangular coordinates, prove (5-13-6). 


ANSWERS TO EXERCISES 


(4) 1 = 2 —1, = y+ 4/5, 2.2.2 + By’? = 66/5, g’?/(33/5) +. 
y’?2/(66/25) = 1, a = 26. b= 1.7. (Ὁ) Substitute the coordinates of the 
new origin in the left member and take the negative of the result. (ὁ) If 
A/C > 0, C > Ὁ, and we have Az’? + Cy’? = 0, which is satisfied only for 
χ' = y' = 0. However, if A/C < 0, that is, C < 0, then we have Ag’? — 
(—C)y’2 = 0, whose left member factors to give (WAa’ + /—Cy’) =0V 
(V Az’ Bf Cy) = 0. (da) y/r = sin (0+ φ) = sin @cos@-+ cos sing = 
(ψ' 77) cos@ + (2'/r) sing. (6) We need to use (4-12-22) and (4-12-23). 
(f) Result is 2’? — y/? = 2. 


ANSWERS TO PROBLEMS 


1. Center: (—1, 2); new equation: 2’? + y’? = 17/3. 3. (8, —2), 2’*/4 — 
y2=1. 5. (2, --8), 2.2 -ἰ y’2 = 0, point ellipse. 7. y'? — 2.2 = 0. 
9. For example, after completing the square in Problem 1, we find 3(% -- 1)? + 
3(y — 2)2 = 17. Then for x’ = «+ 1, we have zo = —1 from (5-16-4). 


CHAPTER 6 
NUMBERS 


*6-1 What are numbers? Everyone knows what numbers are. A 
person begins to learn about them almost as soon as he can speak, has 
many experiences with them as he grows older, and seldom passes a day 
of his life without using them in some way. Our store of knowledge about 
them is so vast and so rapidly growing that only a few specialists have a 
clear picture of it. Yet the sentence with which this paragraph begins is 
hardly true if it is taken to mean that everyone can say precisely what 
numbers are. 

It would be more accurate to say that everyone knows something about 
numbers. Similarly, everyone knows something about electricity, and 
some specialists know a great deal about it. But who knows what elec- 
tricity “really is”? If we attempt to define numbers (or electricity) we 
must do so in terms of other undefined terms, as pointed out in Section 
1-13. Then we may formulate the properties of numbers by means of 
axioms and other laws. Such a theory comes as close as is possible to 
telling what numbers “really are.” 

There are many satisfactory theories describing our number system. 
Although they differ in their undefined terms, their definitions, and their 
axioms, they exhibit the same laws, for the obvious reason that they all 
have been designed to formalize the properties of numbers as they are 
known and used. 

The purposes of this chapter are (1) to suggest a way in which cardinal 
numbers (including the natural numbers and the transfinite numbers) 
can be defined in terms of sets, (2) to familiarize the student with the basic 
properties of the natural numbers, particularly the principle of finite in- 
duction, (3) to outline the way in which the real and complex number 
systems can be built upon a few axioms about the natural numbers, and 
(4) to give the student further insight into the nature of numbers. 

Some of the sections of this chapter are marked as optional because, in 
spite of their considerable mathematical interest, they are not essential 
to later chapters. The treatment is mostly informal, proofs and many 
details being omitted where only a bird’s-eye view of the ideas is given. 


*6-2 The cardinal numbers. The Mohave indians of California used 
to ally themselves with the Yumas to make joint raids on other tribes. To 
coordinate an attack they would send the Yuma warriors a knotted string, 
keeping a similar one for themselves. Each tribe would untie one knot 
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each morning, then attack on the morning of the last knot. (R. F. Heizer 
and M. A. Whipple, The California Indians, 1931, p. 377) We should 
now describe their action by saying that they tied the same number of 
knots in each string. But their technique did not require numbers or 
counting. It depended solely on the recognition that two sets (knots, 
days, mornings) with quite different members may be similar in that they 
can be paired off (knot for knot, knot for morning, etc.) so as to exhaust 
all members of each set. That the Mohaves were pairing off sets in this 
way shows that they were near the idea of number, for the next natural 
step would have been to assign a name to each kind of string of knots and 
use the name to describe any set that could be paired with it. 

This historical example suggests that natural numbers are properties of 
sets. Indeed, we ordinarily say that two sets have the same number of 
members when we can pair them off. Counting a set merely amounts to 
pairing off the members of the set with a subset of the natural numbers. 
We shall call two sets similar if they can be paired off. Then we shall 
define cardinal numbers as classes of similar sets. We use the word “class” 
as a synonym for “set” to avoid phrases such as “a set of sets.” 

To give a precise meaning to the concept of pairing off two sets, we must 
see clearly what the process involves. When we pair off two finite sets, 
we pick out a member of one and a member of the other, and put them 
aside. We repeat the process until all members of both sets are exhausted. 
If any members of either set are left over, we conclude that the sets cannot 
be paired. In short, we attempt to form a set of ordered pairs whose first 
components are from one set, whose second components are from the 
other set, and such that every member of the first set and every member 
of the second set belongs to just one ordered pair. Such a class of pairs 
is a relation whose domain is the first set and whose range is the second. 
More than that, such a class is a function and so is its converse, since 
each first component and each second component belongs to only one pair. 
In the language of correspondences, such a pairing is called a one-to-one 
correspondence. 

(a) Review the discussion of correspondences in Section 5-5 and the remarks 
just before (5-10-5). 


We say that the set A is similar to the set B, and we write A ~ B if 
a one-to-one correspondence can be set up between A and B. Formally, 


(1) Def. A+B 
— IF F € Fun A F* € Fun A Dom (F) = A A Rge (F) = B. 
To show that two sets are similar we may exhibit a function required 


by (1). [See (2-10-5) and the paragraphs that follow it.] For example, 
to show that {a,b} ~ {c,d}, we exhibit {(a,c), (b, d)}, which is a func- 
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tion of the required type. Evidently if one one-to-one correspondence 
exists between two sets, there may be many others, but it is sufficient to 
show the existence of at least one. Where the sets have many or an in- 
finite number of members, it is not convenient to tabulate the one-to-one 
correspondence. In such cases, the function may be specified by giving a 
defining sentence. 

Show that the following are true by exhibiting a one-to-one correspondence. 
(Ὁ) {0,1, 2} ~ {1,2,3}. (c) Married men ~ married women (monogamy 
is assumed). (d) The digits ~ the even numbers from 2 to 20. (6) EF = D, 
where & = the even positive integers, and D = the odd positive integers. 


The following fundamental properties of similarity are easily proved. 


(2) A= A, 
(3) (A = B) > (δ & A), 
(4) [4 - BA BeCl CARH. 


To prove (2) we note that the identity relation, /4, defined in (5-9-7), 
is a function satisfying the definition. Theorem (3) follows from the ob- 
servation that the right member of (1) is logically equivalent to the 
result of interchanging A and B. To prove (4), suppose by hypothesis 
that there is a one-to-one correspondence F between A and B, and a one- 
to-one correspondence G between B and ὦ. Then F ° G is a one-to-one 
correspondence between A and C, and G(F(zx)) in C corresponds to x in A. 


(f) Illustrate the proof of (3) by exhibiting a one-to-one correspondence 
between {1, 3} and {8, 15} and the converse of this correspondence. (g) Sim- 
ilarly illustrate the proof of (4) by exhibiting a one-to-one correspondence be- 
tween {1,3} and {8, 15}, between {8, 15} and {0, 5}, and between {1, 3} and 
{0,5}. (h) How did we use (2—10-9) in proving (4)? 

We now define the cardinal number of a set (roughly speaking, the 
number of members in a set) as the class of all sets similar to it. 


(5) Def. " φγ(9) = {x]x “ S}, 
(6) [S(A) = W(B)| «» (A = 8). 


Definition (5) defines a function 9%, whose domain is the class of all sets 
under consideration and whose range is a set whose members are classes 
of similar sets. δὲ maps each set A onto the class of all sets similar to A. 
‘These classes of similar sets are disjoint, since if 9((A} and 9t(B) have a 
set in common, then because of (4) and (5) they each contain all sets 
similar to it and are therefore identical. Hence every set has a unique 
cardinal number associated with it. 


(i) List several members of 9(({a, b}), where a ¥ ὃ. 
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When a child is learning the meaning of numbers, he associates each 
number with various sets having the appropriate number of members. 
Thus the child associates 2 with a set of two ducks, two children, two toys, 
etc. All such sets associated with 2 are similar. They are members of the 
class of all sets similar to any one of them. We call this class of similar 
sets the cardinal number of any one of them. We define 2 to be this car- 
dinal number, That is the idea behind the following definitions. 


(7) Def. 0 = N(Q). 
(8) Def. 1 = x({0}). 
(9) Def. 2 = N({0, 1}). 


Thus we define the number 0 to be the cardinal of the null set, Le., the 
class of all sets similar to the null set. We define the number 1 to be the 
cardinal of a singleton. Any singleton could be used in place of {0}, but 
in the sequence of definitions the most obvious procedure is to use the 
symbols already defined to make each additional definition. In this way 
we use the symbols “Ὁ and “1,” defined in (7) and (8), to designate the 
pair {0, 1}, whose cardinal is 2. 


(j) How many members does 0 have? (k) How many members does 1 have? 
(1) Define the cardinal 3 similarly. (m) List several members of 1. (n) Define 
4 and list several of its members. (0) For (9) to be a satisfactory definition 
of 2, we need to prove that 0 ~ 1, since otherwise {0,1} might be a singleton. 
Suggest an idea on which such a proof might be based. 


The cardinals 1, 2, 3, ... obtained by continuing the definitions (7) 
through (9) are called natural numbers. We let N stand for the set of natural 
numbers. Each successive natural number is defined as the cardinal of a 
set whose members are just the previously defined cardinals, so that, 
roughly speaking, the sets belonging to each successive natural number 
have one more member than those belonging to the preceding natural 
number. It would be “natural” to call 0 a natural number, but actually 
zero was understood only recently and the term “natural numbers” refers 
to the sequence beginning with 1. 


(p) Assuming that 0 through 100 have been defined, define 101. 


If the cardinal of a set is Ὁ or is a member of Ν᾽, i.e., a natural number, 
we say that the set is finite. In the contrary case we say that the set is 
infinite. Cardinals of finite sets are called finite cardinals, and those of 
infinite sets are called transfinzte. | 

The set of natural numbers N has a cardinal, (NV), which is called 
aleph null, No. Any set that can be put into one-to-one correspondence 
with the natural numbers has the cardinal No and is said to be denwmer- 
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able. For example, the set of even numbers is denumerable, since the func- 
tion 27 is a function satisfying the requirements of the definition (1). 
Under it, each natural number n corresponds to the even number that is 
its double. The correspondence may be suggested by a partial tabulation: 
{(1, 2), (2, 4), (3, 6), ..., (17, 34), ....}. 


(q) Define this one-to-one correspondence explicitly and formally. (r) Show 
that the odd numbers are denumerable. 


The cardinal of the natural numbers, No, is the “smallest” transfinite 
cardinal. It seems obvious that it is not finite and fairly obvious that any 
“smaller” cardinal is finite, but the proof is not trivial. The interested 
reader is referred to Theory of Sets, by E. Kamke, and to the classic 
Contributions to the Founding of the Theory of Transfinite Numbers, by 
Georg Cantor. 

_ We see from the examples that an infinite set may be put into one-to- 
one correspondence with one of its own proper subsets. This is a charac- 
teristic of all infinite sets and is sometimes taken as a definition of them. 


PROBLEMS 


1. Exhibit as many one-to-one correspondences as you can between {1, 2} 
and {3, 4}. | 

2. Do the same for {1, 2, 3} and {0, 1, 2}. 

3. Do the same for {1, 2, 3} and {1, 2, 3}. 

%4. A one-to-one correspondence of a set with itself is called a permutation. 
How many permutations are there of a set of 3 elements? 4 elements? n ele- 
ments? 

5. What conclusion can you draw from A = @? 
6. Show that 4 - B—- A = Bis not a law. 
7. Show that the converse is a law. 
8. Show that 4 = BAC # D—- (AUC) = (BU D)is not a law. 
9. Show that with the added hypothesis A//C A B//D, it is a law. 
10. Show that the perfect squares 1, 4, 9, 16, ... are denumerable. 
11. Show that the set of finite cardinals {0, 1, 2, 3,...} belongs to No. 
12. Show that the integers are denumerable. 
13. Show that the unit fractions 1/1, 1/2, 1/3, ... are denumerable. 
14, Show that 90({1}) = 1, 9t({1, 2}) = 2, OT({1, 2, 3}) = 3. What does 
this have to do with the usual process of counting? 
*15. Show that the positive rational numbers are denumerable by making use 
of the schema in Fig. 6-1. Note that every rational appears in the array more 
than once. However, by following the indicated path and using each rational 
only when it is first named, we may set up the required one-to-one corre- 
spondence. 
* 16. Use the result of Problem 15 to show that the set of all rationals is de- 
numerable. 
*17. Show A EC No A BEN’ - (AU Β) ENo. 
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1/8 2/8" 3/8 . . ° ὸ ὃ Ρ 
17 
1/9 2/9 ° 5 ; ‘ ἃ ᾿ ‘ 
Figure 6-1 


*18. Show that the union of a finite set and a denumerable set is denumerable. 
* 19. Show that the union of a finite number of denumerable sets is denumerable. 
* 20. Show that the union of a denumerable number of denumerable sets is 
denumerable! 

*21. A set that is either finite or denumerable is called countable. Are all sets 
countable? To show that the answer is no, it is sufficient to exhibit an infinite 
set that is not denumberable. We consider the set of all real numbers between 
Ὁ and 1, and use a proof by contradiction. Suppose, then, that this set is de- 
numerable and let F(t) be the real number corresponding to the natural num- 
ber ἡ. Then we imagine the real numbers between 0 and 1 arranged as follows: 


FQ’) = 0.€11€12013...@1;..., 
F(2) = 0.da91d22093...@2;..., 
F(3) = 0.a31@32033...43;..., 
(10) F(4) = 0.a41042043...04;..., 


F(5) = 0.051052053...05i..., 


F(t) = 0.a;14,2a;3 «Δ... 


Here αι is the first digit in the decimal expansion οἱ F(1), a54 is the fourth 
digit in the expansion of F(5), and so on. Now establish the contradiction by 
exhibiting a decimal between 0 and 1 that is not among the decimals in (10)! 
* 22. Show that the set of all points in the interval (0 1) is similar to the set 
of all points on the entire infinite straight line! 


ANSWERS TO EXERCISES 


(0) {(0, 1), (1, 2), (2,3)}. (0) {(2, y)|y = the husband of +}, that is, Hu 
of Section 5-3. (4) {(z, y)|y = 2. +1) A « [8 ἃ digit}. (e) {(a,y|y=at 
1 A xisaneven number}. (f) {(1, 8), (3, 15)}, {(8, 1), (15, 3)}. (6) {{(4, 8), 
(3, 15)}, {(8, 0), (15, 5)}, {(1. 0), (8, 5)}. (ἃ) Knowing the existence of some 
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one-to-one correspondences, we were permitted to introduce “F” and “G” with 
the desired properties. () (1, 2), (0,1), (ὦ, U),.... 

(j) One. (k) S(the set of all singletons). (J) 9t({0, 1, 2}). (ἃ) Any 
singletons. (ὦ) 9U({0, 1, 2, 3}), (1, 2, 3, 4), (Hu, Wi, Mo, Fa),.... (0) See 
Exercises (j) and (k). (p) 9U({0, 1, 2,..., 99, 100}).  (q) {(a, y)|y = 25 
and x is a natural number}. (r) ἡ = 2] — 1. 


ANSWERS TO PROBLEMS 


1. There are two. 3. There are six. 5. A = (6. 7. By (2). 9. A one-to-one 
correspondence for the conclusion is just the union of those for the two sim- 
ilarities. 11. F = 1- 1. Note that “belongs to aleph null” is synonymous 
with “is denumerable.” 13. They are similar to the natural numbers by 1/f. 
15. The rational corresponding to n is found by counting along the path, omit- 
ting duplications, to the nth rational. Conversely, the natural number corre- 
sponding to a rational is found by counting to find the position of the first 
occurrence of the rational. 17. In setting up the correspondence, take elements 
alternately from A and B. 21. Form a decimal by choosing for the ith digit a 
digit different from a; and from 9! 


*6-3 Arithmetic of the cardinals. When a child first learns to add 
numbers, he does so in terms of the union of sets of objects. He has four 
objects in one hand and two different objects in the other. He puts all 
the objects in one hand (forms the union) and finds that he has six objects. 
The number of members in the union of two disjoint sets is the sum of the 
numbers of members in each. We use this idea to define formally the sum 
of any two cardinals. 


(1) Def. min 
= 29 Vr Vy ([z//y A m= Uz) An = σ109)] oz = Ue ὦ φ)], 


(2) A//B — [πί(4 U B) = SA) + 91(8)}. 


(a) Translate (1) literally into ordinary English. (Ὁ) Explain how (1) de- 
fines m+ ἢ as the cardinal of any two disjoint sets having m and n members. 
(c) How does (2) follow from (1)? 


From (1) it is not too difficult to prove the laws of addition for natural 
numbers. For example, the commutative law m+ n = ἢ - m follows 
immediately from (2) and (8-7-3). 


(d) Why? (6) Argue for (1-16—7) for natural numbers. 


Now, from (1-14-19) the rules for subtraction of natural numbers 
follow, and we can derive the following law, which has considerable use 
in probability and statistics. 
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(3) (A UB) = NA) + WB) — N(A ἡ B). 


This follows from (2) and the observation that A UB = A ὦ (A’N B) 
and that B = (A’M B) U (A ἡ B). Since the terms in the right member 
of each of these equations are disjoint, by (2) 9(A U B) = (A) + 
3(A’ ἡ B) and N(B) = 9A’ ἡ B) + 92(A NB). Subtraction of the 
second equation from the first yields (3). 

(f) Show that A UB = AU (4' ἢ B) and that 4//(A’M Β). (9) Show 
that B = (A’M B) U (ANB) and that (A’N B)//(A NB). (h) Argue for 
(3) with a Venn diagram. (i) In 1953 the membership of the American Mathe- 
matical Society was 4,473 and that of the Mathematical Association was 5,289. 
In a joint directory, 7,590 names were listed. How many persons were mem- 
bers of both organizations? 


Multipheation of finite cardinals is sometimes defined in terms of 
repeated addition, but it can be defined very neatly in terms of the idea 
of a cartesian product. 


(j) Glance again at Section 5-2. 
Suppose we have two finite sets A and Bwithm = 9(A) andn = M(B). 
The cartesian product A Χ B can be visualized as in (4). There is a row 


in this array for each element of A and a column for each element of B; 
1.e., there are m rows of n pairs each, or mn elements altogether. 


(ay, by) (ay, θ2) . .. (G1, bn), 


(a2, 61) (2, ba)... (aa, dn), 
(4) 


(Am, 61) (Qm, bo) ees (Gm, bn). 


The above discussion suggests defining the product of any two cardinals 
m and 7 as the cardinal of the cartesian product of any two sets with m 
and n members; that is, 

(5) Def. 
m:n=29Vae Vy (lm = Ux) An = Uy)] > z = Ue X y)), 


(6) (A Χ B) = IA) - NCB). 


(k) Why does (6) follow from (5)? (1) Derive the commutative law of 
multiplication for natural numbers from (5) and (6). (m) Why did we not 
require that A and B be disjoint here? 


From (5) the familiar laws of multiplication can be proved. Then 
division can be defined as in (1-14-1), and the laws of division proved. 
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Of course, subtraction and division may not always have answers if we 
limit ourselves to the natural numbers, so that the laws hold only where 
the result of an operation is defined. 

There is nothing in the definitions (1) and (5) that limits them to finite 
cardinals. For example, we can use (1) to find the sum of aleph null and 
itself, that is, No + No. According to (1), we may do so by choosing any 
two disjoint sets with aleph null members (i.e., any two disjoint denumer- 
able sets), forming their union, and determining the cardinal of the result. 
But by Problem 17 1n Section 6-2, the cardinal of the unton 1s also de- 
numerable. Hence No + No = No. Here we see an example of the fact 
that the arithmetic of transfinite numbers is not the same as that of finite 
numbers, even though the laws that follow from (1) and (5) hold for all 
cardinals. The reader who wishes to pursue this fascinating branch of 
mathematics further may consult the references given at the end of 
Section 6-2. 


PROBLEMS 


1. A has 6 members, B has 25, and the two have 3 in common. How many 
members in their union? 

2. A has 20 members, B has 100 members, and their union has 100 members. 
What is the situation? 

3. A sociologist added together the estimates of different types of patho- 
logical deviants (alcoholics, criminals, mathematicians, etc.) in the United 
States as found in various representative textbooks. The total was about 
104,000,000. Does this result discredit the figures from which it was derived? 

4. Derive the associative law of addition for cardinals. 


Verify (6) for the sets in Problems 5 through 8. 


5. A = {1, 2,3}, B = {4, 5}. 

6. A = {1,2,3}, B = {2,3}. 

7. A = {a}, B = {1,2,..., 10}. 
8. A = {a}, B = No. 


9. Derive the associative law of multiplication for cardinals. 
10. Derive (1-14-8). 
11. Which axioms of Section 1-16 do not hold for cardinals? 
12. Derive the distributive law for cardinals. 
13. Show directly from (5) that O-m = m-0 = 0. 
14. Show from the definitions of this and the previous section that 1 + 1 = 2, 
2+1= 38,and0+1 =1. 
15. Show that No: No = No. 


ANSWERS TO EXERCISES 


(a) m plus n is the z such that for every zx and y if 1 is disjoint with y (see 
3-8-18) and m is the cardinal of x and n is the cardinal of y, then z is the cardinal 
of the union of x and y. (Ὁ) The right member says that z, that is, m-+ n, 
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is Just the cardinal of the union of any x and y that are disjoint and have car- 
dinals m and n respectively. (ὁ) Let m = SU(A) and n = IB), then use 
(3-9-1). (ἃ) By (8-7-3) σα ὦ 8) = N(BU A). Hence WA) + W(B) = 
M(B) + 9(A) for any A and B. Or we may use (1) directly by noting that its 
right member is not changed by interchanging the dummies x and y. Inter- 
changing m and πὶ and the dummies, we find that the right members are identical 
except for differences that can be eliminated by use of the commutative laws 
of logical conjunction and of union. δ) Use (6-2-7) and (8-7--1 δ). 

(Ὁ Expand right member, using (8-7-7), (8-7-19), and (38-7-18). ‘To show 
disjointness, use (8-8-18). (6) Similarly. (h) When we add 9((A) and 9(B) 
we count 9t(.4 M B) twice. (i) 2172. (k) m~-n is defined as the cardinal 
of the cartesian product of any two sets whose cardinals are A and B. Or sim- 
ilarly to Exercise (c). (Π) Similar to Exercise (ἃ). (m) Because we are 
dealing with ordered pairs of members. 


ANSWERS TO PROBLEMS 
1. 28. 3. No. 5. 6 members. 7. 10 members. 9. Use (5) twice, then laws 
of logic and cartesian product. 11. The last three. 18. An informal argument: 
Since the only set with cardinal 0 is the null set, we need the cardinal of the 
cartesian product of the null set with another set. But this cartesian product 
can have no members, since each of its members must have a first element 
from the null sect. 15. See Problem 20 in Section 6-2. 


*6~4 Axioms for the natural numbers. In the last two sections we 
have sketched one way of defining the natural numbers and the opera- 
tions on them. The familiar properties of natural numbers can be derived 
from those definitions. An alternative procedure is to take the natural 
numbers as undefined, characterize them by a few axioms, define the 
operations upon them, and make use of these axioms and definitions to 
derive other laws. The way of doing so that we outline in this section is 
due to G. Peano, an Italian mathematician who lived from 1858 to 1932. 
Of course, no matter how we lay the foundations of a theory of the natural 
numbers we arrive at the same laws and properties. The axioms that we 
state in this section, known as the Peano postulates (though our axioms 
here are not identical with his), could be derived from the theory sketched 
in the previous sections. 

We now take as undefined three constants: “N” (which we consider as 
a name for the set of all natural numbers), “1” (which we consider a name 
for one), and “S” (a name for a function whose domain is NV). We call 
S(x) the successor of x, and 8 the successor function. Our axioms will 
characterize the natural numbers by utilizing the fact that they form a 
sequence in which each number has an immediate successor. 

Our first axiom merely asserts that 1 is a natural number. 


(1) Ax. LEN. : 
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We are familiar with the fact that every natural number has a unique 
successor and that this successor is also a natural number. This is our 
second axiom. 


(2) Ax. nEN > S8(n) EN. 


(a) How is the uniqueness included in the theory? (Ὁ) Graph ὃ in the 
cartesian plane for the first few values in its domain. 


We now wish to introduce the special position of 1 as the first natural 
number. We do this by specifying that no number precedes it, 1.6., there 
is no natural number whose successor is 1. This is our third axiom. 


(3) Ax. ~dxzxEN A Sx) = 1. 


(c) Show that (3) 0 Vrz2 EN > S(z) # 1. (d) H we have in mind that 
S(n) = n-+ 1, is there a number whose successor is 1? Does this contradict (3)? 


We note now that every natural number except 1 has a unique prede- 
cessor, i.e., if the successor of m is the same as that of ἢ, then m = n. 
This is our fourth axiom. 


(4) Ax. [s(m) = 8(n)] > (m = n). 


(6) Show from (4) that m = n — 8*(m) = 8*(n). (ἢ) Argue that (4) 
means that each natural number has no more than one predecessor. (g) Argue 
that (4) means simply that $* is a function. 


The preceding axioms are not sufficient to characterize the natural 
numbers. We need an axiom to indicate that N contains just the numbers 
obtainable by continuing to take successors. According to (1) and (2), NV 
contains 1 and the successor of each of its members. Now we wish to 
specify that N consists of only those numbers that can be obtained from 1 
by successive application of 8, that is, that N consists only of 1 and num- 
bers that are successors of natural numbers. A convenient way of stating 
this is to assert that if M is a set that contains 1 and that contains the suc- 
cessor of each of its members, then ΠΗ must contain all the natural num- 
bers; that is, VN C M. Formally, 


(5) Ax. 1leM AVn(nEM—- Sin) Ε Μ))-Ο Ν ς ἡ. 


In other words, N is the smallest set οἵ numbers that have the properties 
(1) and (2). Still another formulation is that we can reach any natural 
number by starting with 1 and finding successors. 


(h) Consider the set consisting of all the natural numbers and the numbers 
1.6, 2.6, 3.6,..., where S(n) = n+ 1. Show that this set satisfies all the 
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axioms except (5). (i) What is the range of 8) What are the domain and 
range of δε (7) Recalling that 1, N, and 8(n) are undefined above, let 1 be 
the usual number —1, N be the negative integers, and $(n) = n — 1. Show 
that all the axioms are satisfied! 


The example in Exercise (1) indicates that the Peano postulates apply 
to sets other than the natural numbers. Indeed, any denumerable set 
arranged in one-to-one correspondence with the natural numbers satisfies 
the axioms. For let F be any function whose domain is N and for which 
F* is a function. Then let F(1) play the role of 1 in the axioms, let F(n) 
play the role of n, and let S(F(n)) = F(S(n)). Then all axioms are sat- 
isfied by the range of F. In Exercise (j), F(n) = —n. F serves simply 
to rename the members of NV. 


Show that the following satisfy the Peano axioms, and indicate F in each 
ease: (Κ) the even integers, (4) the odd integers. 


To complete our picture of the natural numbers we must introduce 
their properties associated with addition and multiplication. We do so 
by defining “--” and “-” in terms of 8. We begin by defining m + 1 as 
S(m). Then we define m + 8(n) as S(m + n); that is, 


(6) Def. m+1= 8(m), m + S(n) = S8(m+ πὴ. 


For convenience we adopt the usual decimal notation for natural num- 
bers, introduced formally by definitions such as the following. 


(7) Def. 2= 86), 3 = 8(2), 4 = 8(3), 5 = 8(4),.... 


(m) Rewrite (1) through (5) with 8(n) replaced by n + 1. 


Then we can use Eqs. (6) and (7) to find the sum of any two natural 
numbers. For example, to find 3 + 2, we note that ὃ + 2 = 3+ 8(1) = 
(3 + 1) = 8(8(3)) = 8(4) = 5. The easiest way to derive the addition 
tables is to begin by finding n + 2 for values of πὶ as we just did forn = 3. 
Having found these we calculate n + 3 for values of n. For example, 
knowing that 3+ 2 = 5, we have 3+3 = 34+ 8(2) = 88+ 2) = 
$(5) = 6. Next we find the values of n + 4, and so on. In this way we 
can build up an addition table from (6) alone. 


(n) Show that 2 € N. 
The following definition of multiplication is similar to (6). 
(8) Def. m:-l=m, m:-S8(n) = m-n+m. 


(0) What is the scope of the multiplication dot in the right member of the 
right equation in (8)? 
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We should like to prove 
(9) 1-m = ™m. 


Now certainly (9) is true for (m:1), since this is Just 1-1 = 1, which 
is just (m+ 1 = m)(m:1). To prove that it is true for m any natural num- 
ber, we are going to adopt what seems like a rather artificial device. Let 
M be the set of values of m for which (9) is true; that 1s, 


M = {m|1-m = m}. 


Now we shall prove that. N C M, that is, that all natural numbers belong 
to M. This is the same as saying that (9) is true for m any natural number. 
But N C M is the conclusion of axiom (5). Hence our strategy will be 
to prove the hypotheses of (5). Evidently, 1 Ε M, since we have shown 
above that (9) holds for (m:1). To prove Vnn € M — 8(n) Ε M, we 
assume n € M by the Rule of Hypothesis. That is, we assume 1-n = ἢ, 
and hope to derive 1-8(n) = 8(n). Now, by definition (8), 1-S(n) = 
1-n+1. By hypothesis, 1-n = n. Hence 1 - δ(η) =n+1 = 8(n), 
which is what we wished to prove. Hence the implication holds by Q.E.D. 
Since both parts of the hypothesis of (5) hold, the conclusion follows by 
the Rule of Detachment. 


(p) Rewrite the proof in your own words. Review Section 2-6. 


With the aid of the above results, it is now easy to derive the multiplica- 
tion tables step by step along the lines described for addition. Then, by 
methods like those in the proof of (9), we can prove all the laws of arith- 
metic for natural numbers. Our discussion here has been very brief and 
incomplete. For one thing, we have failed to show that the definitions 
(6) and (8) are really adequate to define addition and multiplication of 
natural numbers. The matter is not as simple as it seems. For further 
details, the reader should consult Foundations of Analysis, by Edmund 
Landau. 


PROBLEMS 


1. Argue that (5) implies the following: 
(5)) (MCNAILEMA VYalnEM — δ(η) Ε Μ|))) ~-M = Ν. 


2. Read each of (1) through (5) in words. 

3. Show that N is the intersection of all sets to which 1 belongs and which 
have the property that if n belongs, then 8(n) belongs. 

4. Derive the addition table for 2 - n with n = 1, 2,3,...,9. 

δ. Derive the multiplication table for 2 - n. 

6. Show that m-2 = m+ ™m. 

7. We may define < for natural numbers as follows: 
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(10) Def. m<n=i4rxeENAmt+2=n.. 
Show that 
(11) m<m-+ 1. 


8. Argue that the natural numbers as defined in Section 6-2 satisfy the 
axioms and that S(n) = N({tle =O V (@ENAx<n}) = [2|3ν 32:5} 
nN2ZE1LA y//eAKue=yU2} = {alVyy exe ἃ — {y}) En}. 

9. Show that the natural numbers greater than 100 satisfy the Peano 
postulates. | 

10. Show the same for the natural numbers and zero. 


ANSWERS TO EXERCISES 

(a) Since ὃ is a function. (b) Discrete points lying on y = x+ 1. 
(ec) (2-10-38), (2-5-24). (d) Yes, 0, but 0 is not a natural number. Here we 
are speaking only of natural numbers. (6) (4) = [m # ἢ - 8(m) # 8(n)] by 
(25-28). In this (m:8*(m), nS*(n)) yields 8*(m) ¥ 8*(n) — 8(8*(m)) γέ 
§(8*(n)). But by (5-10-9) the conclusion is the same as m τέ ἢ. Hence $*(m) γέ 
$*(n) — γι τέ n, which is the same as what we wish to prove by (2-5-28). 
(f) 1 has none. 

(g) (6-4-1). (8) Here N in the axioms is the set of all natural numbers 
together with those of the form n + 6/10, where n is a natural number. How- 
ever, now N is not the smallest set satisfying (1) and (2. G) N — {1}; 
N — {1},N. (Ὁ Checkeachone. (k) F = 27. (1) 22 —1. (n) (2), (7). 


ANSWERS TO PROBLEMS 


1. (3-8-5) ὃ. Since it is included in every set with these properties. 
7, (2-10-5). 9. 101 plays the role of 1. 10. 0 plays the role of 1.. 


6-5 Finite induction. Let us imagine a ladder extending into the sky 
without end. Suppose, first, that we are able to reach the bottom rung, 
and, second, that we are able to move from any rung to the next one. It 
seems reasonable to conclude that we could reach any rung. This is an 
intuitive image illustrating the idea of finite induction. 

Suppose we have a sentence s(x) in which the range of the variable in- 
cludes the natural numbers. Suppose we prove that s(1) 15 true (we can 
reach the bottom rung) and that for any 2, zf s(x) 15 true, then s(x + 1) 
is true (we can get from any rung to the next). Then it seems reasonable 
to conclude that s(x) is true for all natural numbers (we can get to any 
rung). 

(a) Imagine a long line of toy soldiers so arranged in a line that if any one 
falls it will knock over the next one. What happens if the first one falls? (b) In 
the ladder image, what is s(x)? (ὁ) What is s(x) in the toy-soldier image? 
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These ideas can be stated in a theorem as follows. 
(1) [s(1) A Wx [s(v) > βίῳ + 1)]] - νη [π E N > s(n)]. 


The letter n is used on the right to suggest that its values are natural 
numbers. 


(4) Could we use n as a dummy on the left in (1)? (0) Let Δ] Ξ {a| s(x) }, 
and state (1) in terms of set membership and set inclusion. 


To prove (1) we make use of the ideas of Exercise (6) and of the proof 
of (6-4-9). (Readers who have omitted Section 6-4 may omit this and the 
next paragraph.) Let M be the set of values of x for which s(x) 15 
true; that is, 44 = {x|s(x)}. Since s(1) by hypothesis, 1 € M. Since 
Vax s(x) — s(x + 1) by hypothesis, γα x Ε M — (x + 1) € M. Hence by 
(6-4-5), N CM; that is, all natural numbers belong to M. In other 
words, s(x) is true for all natural numbers! 

We illustrate the way in which we can use (1) by proving 


(2) 1t+m=m-+il 


from the axioms and results of the previous section. Here 8(5) = 
fltm=m+]1]), 8(1) Ξξ [1 -Ἐ1 Ξῷ 1-Ὁὄ 1], s@+1=[1+@4+1)= 
(zc + 1) + 1]. We have, therefore, to show two things: first that 1 + 1 = 
1-+-1, and, second, that Vz[l1ta2=2+1-1l+@+)D=@4+1))+1). 
The first is true by (2-5-1). To prove the second we use the Rule of Hy- 
pothesis by assuming 1+2=2+1. Toderivel+(@+1)= («+ 1)+1 
from this assumption, we note that from (6—-4-6)(m:1, n:z) we have 
1+ (ὦ - 1) ΞξΞ 8(1+ 2). But by hypothesis, 1 -+2—=2-+1. Hence 
1+ (@+1) = 8( - 1). And since S(x + 1) = (ὦ + 1) + 1, we have 
14+ («4+ 1) = («+ 1) + 1, as desired. Now we have shown both hy- 
potheses of (1) and so can conclude that Vn n € N — s(n); that is, that (2) 
holds for all natura] numbers m. 

It is important to notice the pattern of the above proof. First we show 
that the sentence holds for 1. Next we show that if it holds for a natural 
number, then it holds for the next one. Then from (1) we conclude that 
the sentence holds for all natural numbers. 

The idea of induction can be generalized. Suppose that we are able to 
reach the tenth rung of a ladder and able to get from the tenth, or any 
later rung, to the next one. Then it seems reasonable to suppose that we 
can get to any rung past the tenth. The idea could be stated formally 
and proved, but we prefer to embody it in a new rule of proof. 
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ΑΓΕ oF Inpuction: To prove that a sentence involving a 

variable is true for all natural numbers greater than or equal to a 

(3) given one, it 1s sufficient to prove that the sentence holds for the 

given natural number and that if it holds for a natural number 

not smaller than this, then it holds for the succeeding natural 
number. 


Schematically, 
s(n) 
(4) s(x) > s(x + 1) forz > ny 


εἷς s(n) forn > ny 


(ἢ Write out a law embodying (3) and (4). Include the quantifiers omitted 
in (4). 


When the Rule of Induction is used, the proof of the sentence for a 
particular case [7.1 in (4)] is called the verificatton. The proof that s(x) — 
s(t + 1) is called the induction. To carry out the induction, it is usual 
to use the Rule of Hypothesis to assume s(x) and derive s(x + 1) from 
it. In that case, s(x) is called:the induction hypothesis. 


(g) Review Section 2-6, especially the Rule of Hypothesis. Does assuming 
s(x) in the induction amount to assuming what we are trying to prove? Can 
we use substitution for x in s(x) when it is so assumed? 


Students often have difficulty understanding induction, but under- 
standing comes with familiarity. The procedure is basically simple. Τὸ 
construct an induction proof, the following steps are recommended. 


I. Before beginning to write the proof, write the theorem in 
the formn Ε N — s(n), and write out s(n,), s(x), and s(a + 1). 
Usually n, 181. 
(5) II. Begin by proving s(n1). This ts usually easy. 
III. Complete the proof by proving that Vx s(x) — s(x + 1). 
This 1s usually most easily done by assuming s(x) and deriving 
s(x + 1) from τί. 


We follow this pattern to prove that for any natural number n, 
sin (6 + 2n7r) = sin 6. The theorem isn € N — [sin (θ + 2n7) = sin 6]. 
Here s(1) = [sin 6 = sin (@ + 27)], s(x) = [sin (@ + 2x7) = sin 6], and 
sin (« + 1) = [s(@ + 2(4 + 1)π) = sin 6]. Now for the proof: s(1) is 
true by (4-12-19). Our induction hypothesis is s(x), or sin (6 + 2x7) = 
sin θ. We begin with the left member of s(x + 1) and try to use the induc- 
tion hypothesis and known laws to show that it is equal to sin θ. We have 
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(6) sin (6 + 2(2 + 1)πῈ = sin (6 + 2am ++ 27) 


(7) sin (6 + 2a) (4-12-19) 
(8) = gin 6 (Induction hypothesis). 


| 


But (8) is just s(x + 1). So the proof is complete. 
(h) Prove by induction that cos (@ + 2nmr) = cos 6 for all natural numbers n. 


Both parts (the verification and the induction) of an induction proof 
are necessary. It may happen that s(1) [or even s(n) for many values of 
ἢ] is true, and yet that s(n) is not true for all natural numbers. For 
example, if s(n) = [n? — 1,000,000 < 0], then s(n) is true for the first 
999 values of n, but not for all natural numbers. Here we could perform 
the verification but not the induction. On the other hand, if s(n) = 
[In = n+ 1], then s(n) is always false, yet it is easy to show that s(n) 
- s(n+ 1). Here the verification is impossible and the induction easy. 


(i) Show that [n = n+ 1] ~[n4+1 = (n+ 1) - 1]. (7) Construct ex- 
amples in which s(1) is true but n Ε N — s(n) is not a law. (k) Construct 
examples in which s(n) — s(n-+ 1) is true, but n € N — s(n) is not a law. 


It should be clear from the above discussion that (1) is a theorem of 
mathematics and of deductive logic. It is called the principle of finite 
induction to indicate that it deals with only finzte numbers. Although the 
word “induction” is involved in its name, it must not be confused with the 
process of induction by which we guess general results from particular 
cases. Induction in that sense is a process of heuristic, a method of dis- 
covery. Induction in mathematics refers to the use of the rigorous rule of 
proof (3). 

By using induction we can prove the commutative, associative, dis- 
tributive, and other laws for natural numbers. The process 1s too long to 
give here, and it can be found in the reference given at the end of Section 
6-4. When these proofs have been carried out, (1-16—1) through (1-16-8) 
are theorems about the natural numbers instead of axioms. 

To illustrate once more the use of induction, we prove that the natural 
numbers are closed under addition and multiplication; that is, 


(9) (mEN AnEN) OPO m+neENn, 
(10) (mEN AnEM) > m-neNn. 


To prove (9), we follow (5). Step I: Let s(x) = [m+2¢EN]. We 
could write (9) as mE N > [n Ε Ν᾽ — s(n)], then assume m Ε N by 
Hyp and derive n € N — s(n). Hence it is sufficient to consider m as 
though it were a constant, and to carry through an induction proof on n, 
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as follows: s1) = [m+ 1 EN]; βίυ -Ῥ 1) = [m+ @4+1) EN]. Step 
II: s(1) is true by (6-4-2). Step JII: The induction hypothesis is 
(m-+ 2) € N. From this, (6-4—2)(n:m + x) and (6-4-6), it follows by 
Inf that (m+2)+1¢EN. But by (6-4-6) this is the same as 
m-+ (x +1) EN, which is just s(x - 1). Hence 5(2) — s(x + 1) by 
Q.E.D., and the proof is complete by (38). 


(1) Prove (10) similarly. (m) Why does (9) = [mE N -Ὁ (nEN -- m+ 
πῃ ΕΞ Νὴ} 
PROBLEMS 


We shall use induction many times later in the book. The following problems 
will help the reader become familiar with the method, but are not supposed to 
be sufficient to master it. In every case, follow the outline (5). Note that Step I 
is a matter of “tooling up” for the proof. The proof consists of Steps I and III. 


1. tan (θ - nr) = tan θ. 
2. In a polygon of n sides there are n — 1 diagonals issuing from each vertex. 

*3. Show that every nonempty subset of the natural numbers has a minimum. 
(Hint: Assume the contrary, that is, that 47 C N and M has no minimum. 
Then prove that M’ = N, that is, M = @. It follows that M has no mini- 
mum — M = (Ὁ, or M # @— M has a least member. This property is 
described by saying that the natural numbers are well ordered.) 

4,.nEN—->nt1#F 2, 

*5. From the Peano axioms and other results obtained from them, prove 
the commutative, associative, and distributive laws for natural numbers. 

*6. The sum of the interior angles of a polygon of n sides is (n — 2)π. (Note 
here that ny = 3. To perform the induction, imagine a polygon of 2 sides and 
see what happens when one more side is added.) 

*7. Prove that a polygon of n sides has (n* — 3n)/2 diagonals. 


ANSWERS TO EXERCISES 


(a) They all fall. (0) s(x) = we can reach the zth rung. (c) s(x) = the 
ath soldier falls. (d) Yes. (e) It is just (6-4-5). (ἢ [s(m1) A Vaux => 
ny — (s(x) -- s(x + 1))] ~ Wann EN — s(n). (g) No, it is merely a device 
for proving an implication. No, since we are not assuming that s(x) is true for 
all « (which would be assuming what we are trying to prove!), but merely 
introducing it to prove an implication. (h) The same proof with “cos” for 
“sin.” (i) n=n+1 -ῷ S(n) = 8(n-+ 1), but δ(5) =2+1. Or we may think 
of assuming n = n-+ 1 and adding 1 to both members. (4) s(x) = mrE N, 
s(1l) = m1 EN, s(x-+ 1) = ριί - 1) EN. s(1) is synonymous with 1 € N. 
By (6-4-8), m(a-+1) = mz-+m. The induction hypothesis is mz Ε N. 
Hence by (9), mz + m EN, and we have derived s(a2+ 1. (m) (2-5-30). 


ANSWERS TO PROBLEMS 


1. Like (6) through (8). 3. Begin by showing that 0 belongs to M’. Then 
show that if z belongs to AZ’, so does x -+- 1. | 
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6-6 Sequences. A function whose domain is the natural numbers is 
called a sequence. It is customary to use letters near the middle of the al- 
phabet, such as n, 7, and j, as independent variables for such functions and 
to use those near the end of the alphabet as dependent variables. Thus, 
if f is defined by y = f(x) A x EN, we usually write y = f(n). Also, 
the image of n under f is usually written y,, so that y, = f(r). In this 
section the range of n is always N, so that we need not indicate this 
explicitly in each case. 

The identity function on N, Jy, maps each natural number into itself. 
Here y, = n. A linear function on the natural numbers, m/y + ὃ, 
maps n into mn + ὃ, so that y, = mn + ὃ, yy = m-+ ὃ, yo = 2m+ ὃ, 
y3 = 3m+b, and y= yi1+(n— 1)m We have ynii — yn = 
mn + 1) +b — mn — ὃ = m, so that the difference between two suc- 
cessive images is always the same. We call a sequence defined by a linear 
function an arithmetic progression. Part of the graph of an arithmetic 
progression is sketched in Fig. 6-2. It consists of equally spaced discrete 
points on a straight line. In this context, the slope is called the common 


difference. 
y 


Φ 
Φ (2, 1 = m) 
(1, YD) 


FIGURE 6-2 


(a) An employee begins at a salary of $5000 a year and receives an increase 
of $250 a year. Define the sequence relating the years to his salaries and sketch 
it. What is his salary after five years? 


It is natural to think of a sequence as an infinite list of images: y1, Yo, 
ey Yn—1) Yny Yn+i1, ---. We call these values of the dependent variable 
the terms of the sequence. We call y; the first term and y, the nth term. 
We call yn and yn+1 Successive terms. — , 

We have seen that a sequence may be defined by giving a defining for- 
mula for the function. One may also be defined by specifying the initial 
term and giving a rule for finding the successor of any term. Thus we 
could define an arithmetic progression by writing 


μι =O A Yasui = Yn tm. 


(b) Show by induction that if y,z41 = yrx-+ m, then yn = yi + (n — 1)m. 
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When we define a sequence by giving the initial term and a formula 
for finding the successor of any term, the sequence is said to be defined 
recurswely. The equation giving y,+41 in terms of y, is called a recursion 
relation. Just above we defined an arithmetic progression recursively with 
the recursion relation ¥nz41 = Yn + m. 


(c) Where previously have we used a recursive definition? 


In Section 1-13 we defined a? and a?, but we did not give a general 
definition of a” for any natural number n. This is most conveniently done 
recursively, as follows. 


(1) Def. at=a, αὖ! = a-a". 


(4) Use definition (1) to find a?, a3, αὖ, and α. 


Definition (1) defines a” for any value of a for which multiplication is 
defined. We can use it very conveniently to prove the following laws 
when the exponents are natural numbers. 


(2) αἿ. αὖ = qt, 

(3) (a”)" = am, 

(4) (ab)” = α΄, 

(5) (a/b)” = αὐ 7", 

(6) αὐ /a" = a’ ™ (when n > m), 
(7) aja =] lia"? (when n < m). 


The simplest procedure is to prove (2) through (5) by induction and 
then derive the other two from them. We illustrate by proving (2) and (6). 
We prove (2) by induction on n. (Compare the beginning of the proof of 
(6-5-9)). Here s(x) = [a7a” = αὐ ὙΠ, s(1) = [ata” = αἱ Τῆ], ε(α +1) = 
[αὐ tlgm — gttit™) By (1), s(1) is true. For the induction we assume 
ata” = απο Now a? t!a™ = aa®*™. Using (1) again, we find a**1a”" = 
at™tmt1 which is seen to be synonymous with s(x + 1). Having proved 
s(1) and derived δία + 1) from s(x), we may assert (2) by the Rule of 
Induction. 


(e) Similarly prove (3) by induction on n. | 

To prove (6) we use the definition (1-14-1) to get the equivalent equa- 
tion αὖ = a™a"—™. But by (2) this is equivalent to a” = a”. 

(Ὁ Why n > min (6) (g) Prove (7). | 

It would be natural to extend the meaning of a” to the case of n = 0. 
We are free to define a° as we wish, but we should like to do so in such 
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a way that laws (3) through (8) still hold. For example, we wish a°a” = 
a°*™, But this is the same as a°a” = a”, which implies αὖ = 1. Ac- 
cordingly, we define. 


(8) Def. a® = 1 (a ¥ 0). 


The limitation a ¥ 0 is dictated by the fact that Οἱ = 0 and 0” = 0 by 
(1), so that we might expect both 0° = 0 and 0° = 1. Accordingly, we 
take “0°” as undefined. 


(h) Characterize the sequence defined by (1) when a = 0. (i) Show now 
that (2) holds when m or n is zero. 


The function penne? by y = cr’ is called an exponential function. 
We designate it by er’. So far cr® is defined only for natural numbers (and 
zero), so that we are dealing with an exponential function on the natural 
numbers. The sequence cr‘—! is called a geometric progression. We 
have y; = δ, Yo = cr, 9,8 = cr?, Yn = cr™—*, and Yn4i1/Yn = τ. The 
number r is called the common ratio of the geometric progression. We also 
call r” the nth power of r, and call r the base. 


(j) Sketch the geometrical progression 2- (1.5)'". (Κ) Show by induction 
that if yn41 = ὕψη; then yx = yir*—t. 


We define n!, called n-factortal, by the recursive definition. 
(9) Def. Ol = 1, (n+ 1)! = (ἢ - 1) - π!. 


(1) From (9) find 1!, 2!, 3!, 41, 5!, 10:. (m) Argue that n! = the product 
of the first πὶ natural numbers. (n) Argue that the number of ways four 
objects can be arranged in a row is 4! (Hint: Any one of the four can be placed 
first, then....) (0) Argue that n! = the number of ways n objects can be 
arranged in a row. 


PROBLEMS 
In Problems 1 through 8 give the first few terms of the sequences and sketch 


1. yn = 8- n?. 2.Yn = 2—n. 

3. yn = 10(0.5)". 4. Yn = 1. 

5. yt = 0, Yntt = yn — 1/10. 6. γι = —1, Yn4ti = 2yn. 
7. —Iy. 8. 2(—1)/N. 


9. The first term of an arithmetic progression is 5, and the common differ- 
ence is 2. Find the 500th term. 
10. Suppose that money doubles every ten years if left in the bank to draw 
interest. If one dollar is left for 1000 years, how much will be on deposit? 
11. In how many orders can three people be seated around a table? (We are 
interested here only in their relative positions!) four people? n people? 
12. Prove (4) by induction. 
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13. Derive (5). 

14. Prove that (3) and (4) hold when m or n is zero. 

15. Prove that (5) through (7) hold when m or n is zero. 

16. Can a sequence ever be both an arithmetic and a geometric progression? 

17. The first term of a geometric progression is 3 and the common ratio is —1. 
Find the 25th term. 
* 18. Prove by induction that a2" > 0. 
%19. Prove by induction that the number of ways of arranging n objects along 
a straight line is n! 
%20. Let a sum of money P be deposited at simple interest rate 7, so that the 
interest per period is 7P. Let y, be the total of original deposit and accumulated 
interest at the end of m periods. Show that the mapping n into y, is an arith- 
metic progression. 
%21. Let a sum P be deposited at compound interest 7, so that the interest 
for the nth period is ty,, where yn is the amount on deposit during the nth 
period, and yn41 = Yn + tyn. Show that the function that maps n into ψη is 
a geometric progression, and find r. 
*% 22. In his An Essay on the Principles of Population (1798), Malthus wrote, 
“I think I may fairly make two postulates. First, That food is necessary to 
the existence of man. Secondly, That the passion between the sexes is neces- 
sary, and will remain nearly in its present state.... Assuming, then, my postu- 
lata as granted, I say, that the power of population is indefinitely greater than 
the power in the earth to produce subsistence for man. Population, when 
unchecked, increases in a geometrical ratio. Subsistence only increases in an 
arithmetical ratio. A slight acquaintance with numbers will show the im- 
mensity of the first power in comparison with the second.” Tlustrate the last 
remark by sketching on the same graph the arithmetic progression with y,; = 1, 
m = 2 and the geometric progression with y, = 1, r = 2. Similarly sketch 
yi = 10,m = 3;andy, = 1,7 = 1.5. 
*% 23. Suppose that for each 1000 living people, an average of 30 more are 
born than die in each year. Show that the populations in successive years are 
the terms of geometric progression. 
*% 24. Show that population increases in geometric progression if and only if the 
yearly increase in population is a fixed percentage of the population in that year. 
%25. How do you explain the fact that population has not increased in geo- 
metric progression in the last century? 
* 26. Under what conditions does subsistence increase only in arithmetic pro- 
gression? Do you think these conditions are fulfilled? 
% 27. Find the first few terms of the sequence defined by yo = 0, yi = 1, 
Yn+1 = Yn+ yn—-1. (This sequence is due to Fibonacci, an Italian mathemati- 
cian of the twelfth century; for some of its applications, see Mathematics for 
Science and Engineering, by P. L. Alger, McGraw-Hill, 1957.) 
% 28. The following recurrence relation is adapted from Business Cycles in the 
United States of America 1919-1982, by J. Tinbergen: y, = ayn—1 — bYn—2. 
For each of the following cases, calculate enough terms of the sequence to 
guess its character:a = 1.6,6 = 1,y1 = 0, y2 = 10;a@ = 2.5,6 = 1, yi = 0, 
y2 = 10;a = 1.8, ὃ = 0.1, γι = 0, yo = 10. 
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ANSWERS TO EXERCISES 


(a) ψι = 5000, yn = 250(n — 1) + 5000. Sequence is 250Iy -+ 4750. ye = 
6250. (0) s(z) = (ye = yi + ὦ -- 1)m). 81) = (ψι = m-0+ y1 = 91). 
By hypothesis, yz41 = yz-+ m. From the induction hypothesis, yz41 = yi + 
(x — 1)m+m = y+ («+ 1—1)m, which completes the proof, since 
sa@+1) = GY =yit+(@titl-~i1)m._ (ce) To define addition and multi- 
plication in Section 6-4. (ἃ) a® = aaaaa. (6) s(x) = [(a”)*? = a™]. The 
induction goes like this: (a”)?t! = a™(a™)* = a™a™™ = a™tm*_ These steps 
are justified respectively by (1), the induction assumption, (2), and the dis- 
tributive law. But s(z-+ 1) = [(a™)?t! = a™(?t))]. 

(Ὁ If mn < μι, n — mis not a natural number, and so the expression would 
as yet be undefined. (6) It is equivalent to αἶα" = a”. (8) All zero. 
(i) We have done so for n = 0, but a"a® = a®a" by the commutative law, so 
it follows immediately for m = 0. (j) Discrete points lying on the curve 
defined by y = 2(3/2)*. (k) Use an inductive argument. (1) 1, 2, 6, 24, 
120, 3,628,800. (m) Use induction. (n) We can place any one of the objects 
in first place (in four ways), then any of the others in the next (in three ways), 
then either of the remaining in the next place (two ways), and the last number 
in the last place (one way). Hence the total number of ways is 4°3-2-1. 
(o) Induction. ° 


ANSWERS TO PROBLEMS 


1. 4, 7, 12, 19, 28,.... 3. 5, 2.5, 1.25,.... 5.0, —1/10, —2/10, —3/10,.... 
7. --Ἰ, —2) —3,=—4,...4 9. ψδο0ὺ & ὅ - 409-2. 11. 2; 6; fn — 1)! 17. 8. 


6-7 The summation notation. Let f be a sequence whose range con- 
sists of numbers. We are often interested in the sum of several successive 
terms. For example, let y, = 2n — 1, where n Ε N, so that we are deal- 
ing with the sequence whose terms are the odd numbers. We note that 


(1) y=  1-Ξἢϑ 1, 

(2) | yity2=14+3= 4 = 25, 

(3) yi ty2tys = 14+34+5=9= 83, 

(4) Hitytystys = 14+34+54+7= 16= 4°. 


It begins to look as though the sum of the first n odd natural numbers is 
just n?. But if we wish to prove this, or even to talk about it precisely, we 
need a notation to indicate the sum of n terms. 

Let us indicate the sum of the first n terms of a sequence by s,. Then 
the values of s, are terms of a sequence. Furthermore, 


(5) 81 = Y1, 8..1 = Sn + 5.41. 
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It appears that s, can be found from (5) for any value of n. Indeed, (5) 
amounts to a recursive definition of s,. (For a proof, see Landau’s Founda- 
tions of Analysis.) 

The notation s, has the disadvantage of not indicating explicitly the 
sequence that is being summed. The following notation, called the sum- 
mation notation. 


(6) Sy or Σ᾽ ΧΟ, 
lI 11 


stands for s,, and includes an expression for the terms to be added. The 
summation sign indicates that we add together the results of substituting 
for ὁ, successively, names of each integer from 1 to n inclusive. Now we 
can write our conjecture as follows. 


(7) δ οἱ — ἢ = πἢ 
=) 


(a) Argue that 7 is a dummy in (6) and (7). (0) Write, using the summation 
notation, and evaluate numerically, 85 where y, = 2n — 1, and (0) 84 where 

= (1/2)". 

We get a formal definition of the summation notation by simply re- 
placing s, in (5) with either expression in (6). 


(8) Def. Σ Κὼ = Κῶ), 
(8) Def. δ} fli) = 00 | + fin + 2). 
i=1 i=1 


(d) Rewrite (8) and (8’), using the other expression in (6). 

In (6) we call f(z) or y; the summand, the dummy 7 the vartable of sum- 
mation, and 1 and n the lower and upper limits of summation. 'The notation 
indicates the sum of the values of the summand for all integral values of 
the variable of summation between the lower and upper limits of summa- 
tion, inclusive. For example, 


t==4 
(9) >> (—1)* = (—1)* + (—1)? + (-1)? + (- 4, 
i=1 


(10) > t 
i=1 


Write out the following summations, as we have (9) and (10), with summa- 
tion on the left and expanded form on the right: (6) f(z) = ἐ2, n = 4; 
(Ὁ y = ar'—1,n = 8; (g) y= 8, π = 5. 


1 - 2 - ὃ - 4. 
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Now we can prove (7) by induction. Before starting the proof, we note 
that 


(11) s(x) = bp (Οἱ --- 1) τῷ ὁ 
i=1 

(12) 5(1) = Σ (δὲ --- 1) Ξε ἯΙ 
i=1 


i=r+1 


(13) s(t + 1) = : pH (2ὲ -- 1) = ὦ Ἔ οὐ} 


Now s(1) is immediate from (8). Also from (8), we have 


i=1 t=1 


(14) > (Οἱ --- 1) -- Σ (ἱ — υ ae ἘΣ: 


Using the induction assumption s(7), we have 


t=zr+1 
(15) > Q@-Y)=a+2%@+1= (4+). 


i=1 
But (15) is just s(# + 1), and the proof is complete. 


(h) Similarly prove that the sum of the first n even positive natural numbers 


is n(n -+ 1). 


We may wish to indicate the sum of successive terms of a sequence be- 
ginning with some term other than the first. We allow for this by the 
following definition. 


joan—m+1 


(16) Def. LO = FP fm —1 +9) 


Note that in the right member f(m — 1 - 1) = f(m), f(m — 1+ 2) = 
f(m + 1), and so on, until ἔζη — 1+ — m-+1) = f(n). Hence the 
summation is just the sum of all values of f(z) for 7 between m and n in- 
clusive. Note that there aren — m-+ 1 termsin the sum. This definition 
also has meaning for m and/or n zero or negative integers, provided that 
m « nand that f(z) is defined form < 7 < ἢ. 


Expand the following summations: (i) Σ τ (δὴ; G) Diet ys 
(dees ye ee 


Sometimes, to avoid the summation notation, a summation is expressed 
by writing the first few terms, the last term, and an expression for the ith 


346 NUMBERS [cHAP. 6 


term, with dots to indicate the missing terms. The notation is defined as 
follows. 


(17) Def f(t) +f) te $I) +2 +4) = FF. 
ial 
(m) Define f(m) + film + 1) + ---+ ff) +°+-+f(n). 


The following are fairly obvious, but can be proved from (8) and (8’). 


(8) Σ᾽ φῶ -- 6 Df. 
(19) Sic=(n— m+ De. 


| 


20) + σῶ +9) = Σ' Κὸ + ¥ of. 


=m =m t=m 


(21) si) = ΣΧ + 2 Ιὸ (m<p<n 
=m =m i=ptl 


(n) Illustrate (18) through (21) for f@) = γὼ g@) = 2, m = 1, ἢ = ὃ, 
p = 2. (ο) Of what law is (18) a generalization? (p) Of what law is (21) a 
generalization? 


PROBLEMS 


1. Use the summation notation to rewrite the left member of (22) and then 
prove it by induction. 


(22) 1T+2Q2+---+i+e:+tn = n(n + 1)/2. 


2. State (22) in words. 
3. Write (23) with the summation notation, and then prove it. 


(23) 1+4+..-+7?4.---4+ nn? = n(n+ 1)(2n+ 1)/6. 


4. Prove that if yn = 1/n(n-+ 1), then s, = n/(n + 1). 
δ. Prove (18) by induction. 
6. Prove (19). 
*7. Prove (20) and (21) 
8. Use (18) through (21) to prove that the sum of the first n terms of an 
arithmetic progression is half the number of terms times the sum of the first 
and nth terms; that is, 


=n 


(24) SS [νι + G — Im] 


i=] 


| 


(n/2)[2y1 -ἰ (n — 1)m] 
(n/2) (y1 + Yn). 


Ι 
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9. Prove by induction that the sum of the first n terms of geometric progres- 
sion is given by 


(25) Sr = γα — τῦλα — ἡ. 
i=1 


10. Find the sum of the first 1000 positive integers. 

11. Find the sum of the first ten terms of the sequence 1, 2, 4, 8,.... 

12. Find the sum of the first 100 terms of the sequence (1/2)7N—}. 
#13. Prove [Σ 11 7? = Sor &. 
*14. The summation notation is used to indicate the application of addition 
to successive terms of a sequence. Similar notations are used to indicate other 
operations. For example, 


In 


(26) ΗΠ: Yi 


=m 


means the product of all values of y; form < i < n. What is I= i? 

*15. Define (26) along the lines of the definition of the summation notation, 
or in some other way of your choice. 

%* 16. Give a formal definition of y1 + y2° y3° "Yn. 

*17. State and justify laws for IT analogous is (18) through (21). 

*18. Prove that ΠΙΞῚ i = nl. 


ANSWERS TO EXERCISES 


(a) See Section 2-11. (Ὁ) Vit? (δὲ -- 1 = 143454749 = 

(ὁ) DoE? (1/2)? = 1/2+ 1/44 1/8+ 1/16 + 1/32 = 31/32. 

(4) ΣΙΙΞΙ ee γ i γιὲ = aad + a: 

(6) Wis? =14+4+9+ 16. (f) = ar'—-1 = a+ ars ar?. 
(g) DEP δὲ = 34+64+9+ 12+ 15. 

(h) Here s(x) = [8. = σία -ἰ- 1)]. For ὦ = 1, we have 2 = 1(1+ 1). 
Set41 = 8, + 2(¢-+ 1). Using the induction assumption, s, = σία - 1), we 
find 8,.1 = a(x +1)4 2(5. - 1) = @4+ 1)(4 -Έ 1 - ΞἫ 1), which is just s(z+ 1). 
(i) 10- 15+ 20+ 25. (j) yx3tysatystyet yz (k) yrotyie+ ψ14- 
yie+ yis+ yeo. (Ὁ π2- (n+ 1)? + (n+ 2)? + (n+ 8)2. 

(m) i=" 7. (ὦ) cyr + cya + cya = c(yi + y2 t+ ys), e+e+e = 8ς, 
(ψι + 11) + (y2 + 22) + (ys + 23) = (ψι + y2 + ys) + (ὅτ + 2X2 + 23), 
yi t+ yotys = (yi - ye) + yz. (o) Distributive law.  (p) Associative 
law. 


ANSWERS TO PROBLEMS 
1. Si=ti. 3. Vitti. 9. Assuming s, = yi(1 — r*)/(1 — 7) for the induc- 


tion, we have 8241 = 8e-+ yz41 = yi (1 — r*)/08 — τ) + yir? = yn (1 — r*t1)/ 
(1 —r). 11. 210 — 1. 13. Use (22). 
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*6-8 Extension of the number system. The natural numbers have an 
origin lost in the very first cultural steps of mankind. Other kinds of num- 
bers are relatively much more recent. The extensions of the number sys- 
tem have been dictated by inadequacies of existing numbers for dealing 
with practical problems and by the urge for mathematical completeness. 
Both motives may be summarized by saying that it is desirable for prac- 
tical and aesthetic reasons to have a number system that is closed under 
the operations we wish to perform and also closed under their inverses. 
In this section we sketch in outline the history and logical structure of the 
system of rational numbers. 


(a) What does “closed” mean in this context? 


The natural numbers, NV, are suitable for counting, and they are closed 
under addition and multiplication, which are essentially methods of count- 
ing. But they are not closed under the inverse operations of subtraction 
and division. Geometrically, this means that they are suitable for measur- 
ing undirected distances equal to an exact number of units of measurement, 
but inadequate for indicating direction or fractional parts of distances. 
Algebraically, it means that they are insufficient for solving equations of 
the form a - « = ὃ whena > ὃ, or of the form ax = ὃ when a does not 
divide b. 

Historically, the inadequacy of natural numbers to represent fractional 
parts was felt first. Accordingly, the positive rationals were invented in 
very ancient times, with the numerical and geometric intepretation in- 
dicated in Section 1-2. There are many ways of introducing the positive 
rationals into a formal theory of numbers. A natural one is to consider 
the rational number m/n as a relation (a ratio) to which (x, y) belong if 
and only if x/y = m/n, that is, zn = my. Formally, 


(1) Def. m/n = {(x,y)|my = nz A (t1,y) EN XN}. 


According to (1) the rational 3/4 consists of all those ordered pairs of 
members of N whose components are in the indicated ratio, that is, (3, 4), 
(6, 8), (9, 12),.... It is then easy to prove 


(2) (a/b = c/d) — (ad = be). 


From this (1-14-2) follows. Then if we define addition and multiplication 
of rationals by (1-14-3) and (1-14-8), and subtraction and division, as 
usual, by (1-14-19) and (1-14—-1), we can prove all the laws concerning 
positive fractions. Finally, if we define < by 


(3) Def. (a/b < c/d) = (ad < be), 


the rules of inequalities can be proved for the positive rationals. 
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(0) Derive (2) from (1). (ὁ) By relying on laws assumed already proved 
for natural numbers and on (2), show from (1-—14—-3) as a definition of addition 
that a/b + c/d = (ad-+ be)/bd. (d) Show similarly that addition of posi- 
tive rationals is commutative. (6) Do the same for multiplication. 


Among the positive rationals Ra‘, let us consider just those of the 
form n/1 where n Ε N, and let us call the set of all such rationals K. 
Note that n/1 = {(n, 1), (2n, 2), (3n, 3),...}, so that n/1 and n are not 
wdentical! However, we note that Καὶ = N, since the function F defined by 
F(n) = n/1 maps N onto K in a one-to-one correspondence. Moreover, 
this correspondence has the following interesting properties. 


(4) Fin + m) = F(n) -+ F(m), 
(5) F(n+m) = F(n)-F(m), 
(6) Fim) < Fin)om « ἢ. 


In words, the image of a sum is the sum of the images, and the image of 
a product is the product of the images. These laws follow immediately 
from the definitions of F and the operations. For example, (πὶ + m) = 


(m+ n)/1 = m/1 + n/1 = F(m) + F(n). 


(f) Similarly justify (5). (g) What is F*? (8) Argue that (4) and (5) 
hold also for F*. (i) Complete the following verbal interpretation of (6): 
The images of two natural numbers are in the relation of less than ; 
(1) Show that the positive rationals are closed under addition, multiplication, 
and division. 


The results of the last two paragraphs may be verbalized by saying that 
there exists between N and K a one-to-one correspondence that preserves 
sums, products, and order. In other words, the natural numbers and the 
positive rationals with denominator 1 behave exactly the same under the 
operations and relations in which we are interested. It follows that in any 
calculation it makes no difference whether we use the symbol n or 7/1. 
It is for this reason that we can safely treat n and n/1 as the same in 
arithmetic. 

The relation existing between N and K above is called isomorphism. 
Two sets are said to be isomorphic with respect to certain operations and 
relations if there exists a one-to-one correspondence between them that 
preserves these operations and relations. More precisely, suppose we have 
a set A on which there is a set Sg of operations and a set Sp of relations. 
Similarly, suppose we have a set A’ with operations SZ and relations 
Sr. Then we say that A and A’ are isomorphic with respect to these 
operations and relations provided there exists a one-to-one correspondence 
F between A U Sg U δὲ and A’ U SZ U Sp that maps a in A into a’ 


900. NUMBERS [cHAP. 6 


in A’, οἴῃ So into ο΄ in SG, and rin Sp into?’ in Sp, and such that (a ο δ)" = 
a’ o’ δ' andarb« a'r’ δ΄ are laws. 

When two sets are isomorphic with respect to certain operations and 
relations, every law about one has a corresponding law in the other. The 
two theories differ only in the symbols used and are formally the same. The 
concept is a powerful one, because it often happens that theories about 
quite different things turn out to be isomorphic. This means that the 
things under consideration do not differ in any expressible way with 
respect to the operations and relations being considered. Hither may be used 
as an accurate picture of the other, and a single formal theory suffices to 
tell us everything about them both as far as these relations and operations 
are concerned. 

We may summarize the discussion of the positive rationals by saying 
that they include a proper subset isomorphic to the natural numbers 
with respect to the operations of addition and multiplication, and the 
relation <. 

The negative numbers and zero were added to the number system only 
very recently. Until a few hundred years ago, a — ὃ was considered 
absurd if a < b, and zero is not mentioned in European manuscripts 
until the fourteenth century, although the Hindus and Arabs had used 
zero before that time. Zero and negative numbers may be introduced by 
definitions analogous to those for the rationals. We may define the dif- 
ference a — b as the relation 


{(z,ylat+y=b+2A (t,y) EN XN}, 


define 0 as a — a, and then write —b for0 — banda fora — 0. We then 
get all the integers J defined as differences of natural numbers. The subset 
J* of positive integers (given by a — 0 with a Ε ΔΝ) is then isomorphic 
to N. 

Having defined the integers J as relations on the natural numbers Ν, 
we may then define rationals as ratios of integers by (1)(N:/) and so 
arrive at the rationals Ra. Or we may define the difference a — ὃ as 
{(xz,ylaty=b+-2A (2,y) € Rat X Rat}. This gives us all 
the rationals as differences of positive rationals. The rationals defined in 
this way are not identical with Ra, but they are isomorphic to Ra. The 
integers and rationals can be introduced in many other ways but, barring 
errors, they will always be isomorphic to the numbers as defined here. 
Also, the subset of those differences a — ὃ for which a > ὃ, is isomorphic 
to Rat. The isomorphisms hold with respect to the arithmetical opera- 
tions and the relation of order. On this account, so long as we are inter- 
ested in only these operations and relations, it makes no difference in 
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which way our numbers have been defined. In the same context we may 
ignore the difference between the natural numbers and the positive 
integers. 


(k) Show from the definition of a — ὃ that ὁ — c = d — ἃ, so that 0 is 
uniquely defined by a — a. _ (1) Define < for integers. (1) Show that the 
set of integers of the form a — 0 with a € N is isomorphic to N with respect 
to addition, multiplication, and <. 


The rationals are closed under addition, multiplication, and their in- 
verses (except division by zero). However, they are not closed under the 
inverse of taking powers, i.e., root extraction. To prove this we need 
just one counterexample: 


(7) There 1s no rational number whose square 18 2. 


(n) Restate (7), using quantifiers and other logical symbols. State it using 
the word “irrational.” 


To show (7) we need to use the fact that a natural number is even if its 
square is even. 


(8) μι ΞΕ - ΕΚ. 


This is easily proved indirectly by assuming ~(m Ε )ὺ. Then m is odd 
and can be written in the form 2n - 1 where n € N. Then m? = 4n? + 
4n +1 = 2(2n? + 2n) - 1, which is clearly odd. We have shown 
~(m Ε E) — ~(m? Ε ΕἾ, which is equivalent to (8). 

Now we show (7) indirectly by assuming its negative. Since (7) = 
~IiminmEN AnEN A (m/n)* = 2, its negation is the assumption 
that there do exist natural numbers m and n such that (m/n)? = 2. 
Moreover, we can assume that m and n are not both even, since a common 
factor of 2 could be divided out by (1-14-2). We have, then, m?/n? = 2 
and m* = 2n”. We see that γι must be even, since it has 2 as a factor. 
Hence by (8) m is even; that is, Jjpp ἘΝ A m= 2p. Replacing m by 
2p, we find (2p)? = 2n? or 2p = n?. It follows that n? is even and by 
(8) that-n is even. But this means that m and n are both even, which con- 
tradicts our assumption. Since the negation of (7) leads to a contradiction, 
it is true by (2—7-43). 

The discovery of (7) is attributed to the Pythagoreans, a society of 
mathematicians which flourished during the sixth century B.C. Since 
one can easily construct a segment whose length is +/2 (the diagonal of a 
square with side 1), the discovery meant that rational numbers are in- 
adequate to represent distances. It was one cause of the divorce of arith- 
metic and geometry, which lasted until very recent times, for the exten- 
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sion of the number system to include irrationals is not as easy as the 
previous extensions. Whereas rationals can be defined as simple relations 
on the natural numbers, it requires infinite sets of rational numbers to 
define a real number. We postpone discussion of this extension to the next 
section. However, its pattern is the same as before: definitions in terms 
of previous numbers, then proof of the laws of algebra. 


(o) Prove that 4/3 is not rational. 


PROBLEMS 


1. Show that the even numbers are closed under addition and multiplication. 

2. Show that the odd numbers are closed under multiplication but not under 
addition. 

3. If we defined rationals as ratios of integers, instead of Just natural num- 
bers, what additional limitation would we have to add to (1)? 

4. Show that multiplication of rationals defined as ratios of natural numbers 
is commutative. ; 

5. Show that the set of vectors (z, y) whose first component is a natural 
number and whose second component is 0 is isomorphic to the natural numbers 
under the operation of addition. 

6. Prove (1-14-2) from (2). 

7. Once negatives and rationals have been introduced, it is natural to try 
to extend the meaning of a” to cases where 7 is any rational number. We do 
so by the following definitions. 


(9) Def. am” = yax™ =a" A [ΠΕ ΚΡ - α 0], 
(10) Def. qn? = 17a". 


Since we have previously defined only non-negative integral exponents, (9) 
applies only to m and πὶ belonging to N or m = 0 and defines non-negative 
rational exponents. Then (10) has meaning for all non-negative rational values 
of n and defines rational exponents. Read (9) in words. What is the range of 
significance of a in (9)? 

8. Use (9) to show that a!/? = v/a by reference to (1-13-16). 

9. Use (9) to show that non-negative rational exponents follow the laws 
(6-6-2) to (6-6-7). 

10. Assuming that the proofs of Problem 9 have been accomplished, use (10) 
to show that rational exponents satisfy these laws. 

11. The number a!/, defined by (9)(m:1), is called the principal nth ΤῊΣ of a 
and is usually designated by a. Any number z such that x” = a is called an 
nth root of a. How many square roots (n = 2) does a non-negative real number 
have? a negative real number? | 

12. Show that a”/" = ὕ απ, 

13. Show that fora > 0, b = 0, 


(11) Vans = Vad; 
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(12) WV q = Vay 
(13) | (Wat = Yea. 


14. Show that (13) is not a law if the limitation on a and ὦ is removed. 
KIS. IfaeECAbECCAaK<bAn~AxECAa<ez « ὃ, wecall a and 
ὃ adjacent members of C, a the immediate predecessor of ὃ, and ὃ the immediate 
successor of a. A set in which every member (except the smallest, if any) has 
an immediate predecessor and every member (except the largest, if any) has 
an immediate successor, is called discrete. Explain why the natural numbers 
are discrete. 

* 16. Show that the rationals are not discrete. 

* 17. Show that the negative integers are discrete. 

* 18. Show that any subset of a discrete set is discrete. 

* 19. A set in which there are no adjacent members is called dense. Show that 
the rationals are dense. 

*% 20. Show that we can find a rational number arbitrarily close to any given 
rational number. 


ANSWERS TO EXERCISES 


(a) See Section 5-5. (Ὁ) a/b consists of all pairs (a, y) such that ay = bz, 
and c/d consists of those for which cy = dz. If a/b = c/d, then these sets must 
be the same. That is, (x, y) satisfies ay = bz if and only if it satisfies cy = dz. 
Alternatively, we note first that (m,n) Ε m/n by (1), since mn = nm. Hence 
if a/b = c/d, then (c,d) Ε a/b or ch = da. Conversely, if ad = be, then 
(x, y) € a/b > ay = bx «» acdy = bedx ἐ cy = dx «Ὁ» (x,y) Ε c/d, so that 
a/b = c/d by (3-3-4). (c) a/b + c/d = ad/bd + cb/bd = (ad + cb)/bd. 
(d) Since ad-+ cb = οὖ - ad by commutativity of addition of natural num- 
bers. (e) Using the commutativity of multiplication for natural numbers and 
the definition (1-14-8), (a/b)(c/d) = ac/bd = ca/db = (c/d)(a/b). 

(ἢ F(nm) = nm/1 = (n/1)(m/1) = F(n)F(m). (g) F*(n/1) = n. 
(8) Since the correspondence is one-to-one and (4) and (5) hold for F. (i) if 
and only if the corresponding natural numbers are in the same relation. 
(j) Since in each case the definition yields a ratio of positive natural numbers. 
(k)aty=ateoya=rteQnrbty =b72. Gd) [a —b «-ς -- ἃ] = 
fa+d<b+ cl. 

(m) F(a) Ξ α --ΑΟ. F(a+b) =a+b—-0=a—-0+6—-0 = F(a)+ Fi). 
(n) ~ dxx Ε Ra A x? = 2. V2 is irrational. 

(0) V3 = m/n > [832 = m2] - ὁ. [Ipp EN A m = 8p] > 3n? = 9p? > 
n? = 3p? > Add ENA ἢ = 84. 


ANSWERS TO PROBLEMS 
1. 2m + 2n = 2m-+ n), (2Qm)(2n) = 2(Qmn). 8. n τέ Ὁ. 5. Since (a, 0) + 
(x’,0) = (x + 2’, 0), F(x) = (2, 0). 7. Non-negative reals when ἢ is even, reals 
when n is odd if we wish x to be real. Of course, if we were being thoroughly 
rigorous, α would be restricted to rational values since real numbers have not 
been defined as yet. 
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6-9 Real numbers. There are many ways in which the real numbers 
can be defined satisfactorily. All methods have the following features in 
common. 


(1) The real numbers are defined in terms of rational numbers. 


(2) The arithmetic operations and the relation of inequality are 
defined in terms of these relations for rational numbers. 
With these definitions, the real numbers are proved to satisfy 
(3) the fundamental algebraic identities (1-16-1) through (1-16-10) 
and the laws of inequalities of Section 3-5. 


Included in the real numbers 1s a subset tsomorphic to the 
rationals. 


(4) 


(5) The real numbers are tsomorphic to the points on a straight 
line with respect to the operations of arithmetic and order. 

(a) If you did not study Section 6-8, read now the paragraphs on isomorphism 
following Exercise (e). (0) Why does it follow from (4) that the reals in- 
clude subsets isomorphic to the integers, natural numbers, evens, etc.? (c) What 
is the one-to-one correspondence under which (5) holds? What operations on 
points correspond to the operations of arithmetic? What is the image of <? 


To carry out (1) through (5) in detail is much too time-consuming for 
our purpose here. Instead we mention some ways in which the definitions 
can be formulated. In Section 1-15 we described real numbers in terms 
of infinite decimals, an idea which can be used to formulate rigorous 
definitions and proofs. For details the reader is referred to the first pages 
of The Theory of Functions, by J. F. Ritt. The real numbers may be de- 
fined as cuts of rationals. A cut is defined in Edmund Landau’s Founda- 
tions of Analysis as a subset of rationals that contains at least one but not 
all the rationals, that contains no greatest member, and every one of 
whose members is less than every rational not belonging to it. Speaking 
in rough geometric terms, a cut is the set of all rationals to the left of a 
given point on the axis, and the point (real number) is defined by speci- 
fying the cut. For details, the reader is referred to Chapter III of Landau’s 
book. Also, real numbers may be defined in terms of sets of nested inter- 
vals. A sequence of nested intervals is defined as a sequence of closed inter- 
vals I,, each consisting of the rational numbers between two rationals, 
each one included in the previous one, and the length of the intervals ap- 
proaching zero as ἢ increases. An informal discussion of this approach 
may be found in Chapter II of What is Mathematics?, by R. Courant and 
H. Robbins. 
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(4) Describe an infinite decimal as a special kind of sequence. (06) Review 
Section 1-15 and suggest a connection between the definitions in terms of 
nested intervals and in terms of infinite decimals. (ἢ) Suggest a definition 
of < for infinite decimals. (9) Show that {112 < 2A 2€ Ra} is a cut. 


It is not hard to show that between any two rational numbers there is 
another, since if a and ὃ are rational so is (a + b)/2. It follows that there 
are an infinite number of rationals between any two of them. We de- 
scribe this property by saying that the rationals are dense. 


(h) Why does a € Ra A bE Ra — (a-+ δ), Ε Ra? (i) Show that 
a<b—-a< (a+ 6)/2 <b. (j) Show that the number of rationals be- 
tween two rationals is not finite (1.6., is infinite). (k) Show that our definition 
of “dense” here is equivalent to that given in Problem 19 of Section 6-8. 


The property of denseness of the rationals means that they are very 
“tightly packed” on the line. On the other hand, from Problem 21 in 
Section 6-2 we know that in some sense there are “more” reals than 
rationals, so that the real numbers must be even more tightly packed on 
the line than the rationals! We close our sketch of the real numbers by 
discussing this idea more carefully. 

We call x an upper bound of a set B if all members of B are no greater 
than x; that is, Vzz € Β -ος < x. Similarly we call y a lower bound of 
BitVzezEB—oz> y. Aset that has an upper (lower) bound 15 said to 
be bounded from above (below). One that has both is said to be bounded. 
One that has neither is said to be unbounded. 


(1) Give examples of each category. 


A bounded set has infinitely many bounds, since if zx is an upper bound, 
any number greater than x is an upper bound, and similarly for lower 
bounds. For example, the set of all numbers between 0 and 1 is bounded. 
Among its upper bounds are 1, 2, 1.001, “2, 1000, and 1.3. Among its 
lower bounds are —99, —0.00001, and 0. Is there a smallest among its 
upper bounds? Yes, 1 is the smallest upper bound, for if x < 1, then the 
denseness of the reals implies that there is a number between x and 1, 
and hence a member of the set greater than +. Hence x cannot be an 
upper bound. 

If there exists a number that is smaller than all other upper bounds of a 
set, we call it the least upper bound (lub). Similarly, the greatest lower 
bound (glb) is the largest of the lower bounds. I‘or the open interval be- 
tween 0 and 1, the lub is 1 and the glib is 0. Note that in this case they 
are not members of the set. 

Let us consider the set A of positive rationals whose squares are less 
than 2, A = {x!/a? <2 A2€RaAx> 0}. This set is certainly 
bounded from above. Among its upper bounds are 2, 3, 2.4, 1.5, 1.42, 
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1.415, ete. But does it have a least upper bound? Not if we limit our- 
selves to rational numbers, for if x is any rational greater than +/2, we 
find another rational between x and +/2. Hence A has no least upper 
bound among the rationals. However, if we include all reals, A has the 
least upper bound «2. 


Find the lub and glb, if any, of the following: (ἢ) N, (ἢ) J, (0) Ret, 
(p) {a]a? « 4), (4) {xl2? « 8), (ἡ) {z|2? < 3} 


We have seen that unless we include the irrational points on the line 
it may happen that a bounded set has no least upper or greatest lower 
bound. However, if the irrationals are included, then it can be proved 
from the definitions of real numbers that 


(6) Every set of real numbers that 1s bounded from above (below) 
has a lub (glb) that 18 a real number. 

A set B such that every subset bounded from above (below) has a lub 
(glb) that is a member of B is called continuous. The real numbers are 
continuous, whereas the rationals are not. The real numbers are sometimes 
called the continuum. It is the continuousness of the reals that enables 
them to completely fill in the line without any Jumps (such as exists be- 
tween the integers) or gaps (such as exist among the rationals). 


(s) Argue that every real number is the lub of the set of all the approxima- 
tions s; formed by replacing all digits after the 7th by zeros in its decimal ex- 
pansion. Note that this defines each real as the lub of a set of rationals—an 
approach alternative to infinite decimals. (Ὁ) Show that every non-empty 
subset of N contains its own glb. %*(u) Prove from (6) that there is a real 
number whose square is 2. 


The continuousness of the real numbers, expressed by (6), is useful in 
many ways. As an illustration, it enables us to extend the definitions of 
exponents to all real values. When a > 0, αὖ is defined for all rational 
values of x by (6-8-9) and (6-8-10). Then we can define αὐ for any real 
number ¢ as the least upper bound of all the numbers αὖ for rational 
values of « < c; that 18, 


(7) Def. α΄ = lub {z|Jra2 EE Ra Nx «ολ 24- a’*}. 


Roughly speaking, a° is a real number that is approximated arbitrarily 
closely from below by taking rational powers with exponents smaller 
than c. With such a definition all real exponents can be proved to follow 
the familiar laws. 


(v) Suggest a definition of the product of two reals in terms of least upper 
bounds. 
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PROJECTS 


1. Study the discussion of the number system in Chapter I, sections 1 and 
2, of What is Mathematics?, by R. Courant and H. Robbins. 

2. Study the theory of infinite decimals in The Theory of Functions, by J. F. 
Ritt. Try to reformulate the discussion as symbolically as possible. (This 
reference is more difficult than that of Project 1.) 

3. Study Edmund Landau’s book, Foundations of Analysis. (This is suitable 
for a term project.) 

4. Study Theory of Sets, by ἘΞ. Kamke (Dover, 1950), for further details 
on continuity, denseness, etc. 


ANSWERS TO EXERCISES 


(b) Because the rationals include such subsets, and hence any isomorphic 
set must also have such subsets. (0) F(a) = the point corresponding to z. 
Dom (F) = real numbers, Rge (F) = axis. The image of < is the relation 
(to the left of). (d) F(0) = the integral part, F(1) = the first digit after the 
decimal point, F(t) = the ith digit. (6) The infinite decimal specifies a 
sequence of nested intervals in which the ith has length 10—*. (ἢ The greater 
is the one that has a greater number in the first position in which they differ. 
(g) A subset of Ra, since z © Ra is part of the defining sentence; non-empty, 
since 0? < 2; does not contain all Ra, since 37 > 2; contains no greatest 
member, since we can always find a rational in any interval; and if y* > 2 
and 22 < 2, certainly z < y. (8) Because rationals are closed under addition 
and division. (1) a+ (6 — a)/2 = (a+ b)/2 = ὃ — (ὃ — a)/2. 

(j) Suppose the number is finite, then choose the largest. There will be 
another between it and the bigger of the two given rationals, thus contradicting 
the assumption. (k) No adjacent members means that there is always an- 
cther between any two. (τα) No lub, glb = 1. (n) Not bounded. (0) No 
lub, glb = 0. (p) lub = 2, glb = —2., (q) lub = V3, glb = —V3. 
(r) Same as (q). (8) See Section 1-15. (0) See Problem 3 in Section 6-5. 
(v) The least upper bound of the set of all products of rationals less, respec- 
tively, than the two real factors. 


* 6-10 Systems of numeration. Our decimal system of expressing num- 
bers is called a positional system, because the meaning of each digit in a 
numeral depends on its position relative to the others. Thus in 9375, 9 
stands for 9000, 3 for 300, 7 for 70, 5 for 5, and 9375 for the sum of these. 


(1) 9375 = 9. 108 + 3-10? + 7-10! + δ. 10°. 


The number 10 is called the base of the system. Every number is expressed 
as a sum of terms each of which is the product of a number less than the 
base times a power of the base. 

Different civilizations have had distinctive systems of numeration, and 
it is only recently that the decimal system, based on the number 10, has 
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come into general world-wide use. There is nothing essential about the 
use of 10. Other civilizations have used other bases, or even no base at 
all, and 10 is not the best base for all purposes. 

The system of numeration using the base 2, called the binary system, 
requires just two digits, 0 and 1. It has great practical importance today 
because it is the system used by most of the new electronic “brains.” 

To distinguish numbers written in the binary system we place the sub- 
script b after binary numerals whenever they appear with common deci- 
mals. Then we have | 


l,= 1 101, = 5 1001, = 9 1101, = 18 

10, = 2 110, = 6 1010, = 10 1110, = 14 

o li, = 3 11 ef 1011, = 11 1111, = 15 
100, = 4 1000, = 8 1100, = 12 10000, = 16 


Note that 1, = 2°, 10, = 2', 100, = 27, 1000, = 2°, and 10000, = 
24 justas1 = 10°,10 = 10!, 100 = 107, 1000 = 10°, and 10000 = 105. 
Note also that any binary numeral expresses a number as a sum of terms 
each of which is either 0 or 1 times a power of 2. For example, 


(3) 101101, = 1-2°+0-2*+1-2%?+1-2?+0-2'+1-2° 


Write as in (3) and then evaluate in the common decimal notation: (a) 10], 
(b) 10015, (ce) 10001,, (4) 100001,, (6) 1010114. 
Express in the binary notation: (f) 9, (6) 17, (h) 23, () 84, (j) 50. 


The reader may wonder why it is advantageous to use a notation in 
which so many digits have to be written to represent a number. The answer 
lies in the following addition and multiplication tables for the binary 
system. 

Addition Mutitplication 


(4) 


With the use of the tables, binary arithmetic follows the usual rules. 
Since (4) is so simple, very little strain is put on the memory and the opera- 
tions are very easy. This is particularly important for electronic brains, 
since they perform routine operations very quickly, but are relatively 
poor in memory and lack initiative. To add two binary numerals, we 
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place them in the usual way. Then each column adds to zero or one, with 
one carried when both digits in the column are 1’s. Tor example, 


1011011 
(5) + 101110101 


111010000 


Multiplication is even easier, since to multiply by 1 we simply recopy and 
to multiply by 0 we do not recopy! For example, 


1011101 
x 10110 


(6) 1011101 
1011101 
1011101 


11111111110 


Perform the following operations, where all numerals are binary, by using (4) 
and the usual rules of arithmetic: (k) 101111 -+ 101001, (1) 1011 - 1011 4+ 
10101, (m) 11101 — 1011, (m) 11011-11001, (0) 11011/11. 


Decimals less than 1 can be represented in the binary system as easily 
as in the decimal system. For example, 1.101, = 1+ 1.2 - 1/8 = 
1+ 27* 4 27%, 

(0) Find in the usual decimal notation 11.011, and 0.101011,. (4) Find, 


by using the binary notation only, (1.101,)(0.101011,) and check your result 
by changing to the usual decimal notation. 


The number system based on twelve is called the duodecimal system. 
It requires two additional digits. Let eg = 10, fg = 11, where the sub- 
script d refers to duodecimal numerals. 


(r) Construct addition and multiplication tables for duodecimal arithmetic. 
(s) Transform to common decimals: 114, 234, 380. (t) Transform to duo- 
decimals: 18, 95, 144. 


PROBLEMS 


1. Find the decimal expansion in the duodecimal system of 1/3, 1/4, 1/6. 

2. Can you think of any advantages of the duodecimal system over the 
decimal system? 

3. Investigate the system with base 8. Make addition and multiplication 
tables, transform numbers between this system and the common decimal 
system, do sample computations, and evaluate fractions. 

4, Could we represent all real numbers as binary decimals? Would this 
simplify the arguments ‘of Section 1-15? Carry through the argument of that 
section with this change. | 
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5. Show that if one carries out the computation of Problem 11 in Section 
1-10, the last two digits in the final answer give the change in one’s pocket and 
the first two one’s age. (Suggestion: Let A = 105. - y, C = 10a-+ ὃ.) 

6. Why do you suppose that the binary system is convenient for machines 
that operate electrically? (Hint: The current is either off or on in a circuit, so 
one could represent 0 and 1 by....) 

* 7. On the face of a nuclear scaler used to count cosmic radiation particles, 
there appears a row of eight little lights labeled 1, 2, 4, 8, 16, 32, 64, 128. When 
the machine is operating, the lights flash on and off. Immediately after all the 
lights are on at once, they all go off at once, and at the same time the number 
showing in a little window increases by one. Then the lights begin flashing, 
seeming to move from left to right. What is the explanation? 

*8. Use simple examples to justify the usual procedures for doing arithmetic 
by reference to laws of algebra. 

*9. Show that a number is divisible by four if and only if the number repre- 
sented by its last two digits in the decimal system is divisible by four. 

* 10. Show that a number leaves the same remainder when divided by nine 
as does the sum of its digits when written in decimal notation. 
*% 11. Show the same for a number divided by eleven. 


ANSWERS TO EXERCISES 
(a) 5. (b) 9. (ce) 17. (ὦ 38. (e) 485. (Ὁ 1001. (Ὁ) ΙΟ00Ι. 
(h) 10111. (i) 100010. (j) 110010. (k) 1011000. (1) 101011. 
(m) 10010. (n) 1010100011. (0) 1001. (p) 27/8 and 43/64. (9) 13, 27, 
36. (0) lla, 7fa, 100z. 


ANSWERS TO PROBLEMS 
1. 0.44, 0.34, 0.24. 2. More simple fractions are terminating decimals. 4. Each 


interval would be 1/2 the size of the preceding one. 5. 50(2(10x + y) + 5) + 
(10a + b) — 365-4 115 = 1000z + 100y + 10a + ὃ. 


* 6-11 Approximation. We assume that the reader is able to carry out 
arithmetic operations with decimals. Effective numerical calculation in- 
volves more than this. Indeed, because of the power of modern calculating 
machines, the ability to do arithmetic is less important than an under- 
standing of the nature of arithmetical operations and the errors inevitably 
involved. Error is an essential part of most numerical work for several 
reasons. In the first place, in practical computations we are usually work- 
ing with numbers obtained by measurements that cannot be exact. Sec- 
ondly, we have seen that most numbers cannot be expressed exactly by 
terminating decimals, and irrational numbers cannot be expressed as 
repeating decimals or ratios of integers. Thirdly, the capacities of machines 
and the time and patience of human beings set limits to the accuracy of 
our work. Even if we knew two numbers exactly*to 100 decimal places, 
we would not care to find their product exactly, since this would require 
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200 decimal places! The purpose of this section is to familiarize the reader 
with standard terminology for talking about approximation. 


Errors. Suppose A is an approximation to the true value 7’. Then we 
define the error E by 


(1) E= 4A —T. 
For example, let A = 0.67 and T = 2/3 = 0.6. Then E = 0.67 — 
0.6 = 0.67 — 0.666... = 0.00333... = 0.003. 

(a) Complete: The error is the _______ minus the _______.._ The error 
is negative θη. 6Ὸϑ, (0) Verify the result in the example in the previ- 


ous paragraph by calculating # = 67/100 — 2/3. 
The relative error RF is defined by 


(2) R= E/T. 
(c) Complete: the relative error is the _____. divided by 6. 
or the ratio of the _________ to the _______.._ (d) What is the relative error 


when 0.26 is used as an approximation to 0.264? 


Precision. Usually we do not know the error when we are using an ap- 
proximation, but we almost always have some idea of how large it may be. 
We may know, for example, that the digits in a decimal are reliable from 
the first digit on the left up to a certain one, but unreliable after that. 
If the last reliable digit is to the left of the decimal point, we say that the 
number is precise to units, tens, hundreds, etc., according to the position 
of the last reliable digit. If the last reliable digit is the nth one at the right 
of the decimal point, we say that the number is precise to n decimal places. 
When we say that a certain digit is reliable, we mean that the absolute 
value of the error is not greater than the number represented by 5 in the 
next decimal place. We use the word precision to refer to reliability 
measured in this way. : 


(e) How large could the error be if a decimal is known to three decimal 
places? (ἢ) if it is known to units? (6) if it is known to hundreds? 


Accuracy. The reliable digits in a decimal, not including initial zeros 
used to indicate the position of the decimal point, are called significant 
digits. For example, there are 4 significant digits in 0.03105, if we assume 
that all digits are reliable. The reliability of a decimal may be indicated 
by giving the number of significant digits. We use accuracy to refer to 
reliability measured in this way. 


(h) Suppose 3.149 is accurate to three significant digits. Find the possible 
error and possible relative error. (i) Do the same for 31.49. 
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Rounding off. It is often desirable to round off a decimal either to elimi- 
nate unreliable digits or to simplify calculations. The following rule is 
generally used. 


If the digits to be dropped represent less than half a unit in the 
last digit retained, drop them and make no other change. If they 
represent more than half a unit, drop them and increase the last 
retained digit by one. If they represent exactly half a unit, the 
last retained digit is left unchanged 1 tt 1s even and increased by 
one 1} τὶ ts odd. 


(3) 


In brief, round off to the closer approximation, choosing the even possi- 
bility if they are equally close. For example, we round off 3.14159 to five 
significant digits (four decimal places) by writing 3.1416, which is closer 
to 3.14159 than is 3.1415. But we round off 3.1415 to 3.142, since 3.141 
and 3.142 are equally close. 


(j) Round off to 3 significant digits: 3.14159; 0.03841; 99.47; 99.45; 37,251; 
63140. 


Scientific notation. It is customary in science not to write unreliable 
digits, except sometimes during calculations. When this convention is 
followed, we know that all digits are reliable. But if we try to round off a 
decimal accurate to, say, hundreds, we find that we must write unreliable 
digits in the usual notation. For example, suppose 582,937 is accurate 
only to hundreds. We may round it off to 582,900, but we must keep the 
zeros even though they are unreliable. The difficulty is avoided if we shift 
the decimal point until it lies to the right of the first significant digit 
(i.e., the first nonzero digit from the left), and compensate by multiplying 
by some power of 10. Thus 582,937 = 58293.7 Χ 10 = 5829.37 x 107 = 
582.937 Χ 10° = 5.82937 x 10°. Now we round off to 5.829 x 10°, 
where we no longer need the zeros. 

A rule for writing numerals in this way is: 


Move the decimal point to the position immediately to the right 
of the first significant digit and multiply by 10”, where n 18 the 
number of places from this position to the original position of the 
decimal point, counting positive to the right, negative to the left. 


(4) 


Numerals expressed in this way are said to be In sctentific notation. 


(k) Write in scientific notation: 48.2; 951.6; 2.97; 0.199; 0.03846; 30.016; 
0.00074; 3,001,309; 1,000,000,000. (1) Write in the usual decimal notation: 
8.91 Χ 107, 3 Χ 10°, —4.397 Χ 103, 1.31 X 107!, 4.985 Χ 10~°. 


The scientific notation has other advantages. It saves space in writing 
decimals representing very large or very small numbers. It permits the 
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quick comparison of numbers. It indicates at a glance the accuracy of a 
decimal, since the number of digits that appear is the number of significant 
digits. 

Order of magnitude. If we express a decimal in scientific notation in the 
form A Χ 10”, then replace A by 1 or 10 according to whether A < 5 
or A > 5, the result is a power of ten that we call the order of magnitude 
of the original number. It is roughly the nearest power of ten. For exam- 
ple, the order of magnitude of 4.92 is 1, of 5.78 is 10, of 68 1s 100, of 0.3 
is 10~?, ete. 

Possible errors. Suppose that a decimal (not in scientific notation) has 
been rounded off to n decimal places and is known to be reliable to the nth 
place. Then the absolute value of the error cannot exceed 1/2 unit in the 
nth place or 5 in the next place to the right. Thus precision gives us an 
upper bound to possible absolute values of the error, which we call the 
possible error. 

Suppose that a decimal has been rounded off to n significant digits, 
where all are known to be reliable. We cannot estimate the error, since 
the number of significant digits does not tell us about the position of the 
decimal point. But we can estimate the relative error. Consider 3.19, 
where we assume all digits significant. The possible error is 0.005, the pos- 
sible relative error is 0.005/3.19 = 5/3190 = 0.016 = 1.6 x 107°. 
For 31.9 we have 0.05/31.9 = 5/3190, for 319 we have 0.5/319 = 5/3190, 
for 0.319 we have 0.0005/0.319 = 5/3190, and so on. The possible rela- 
tive error evidently is independent of the position of the decimal point. 
Hence, to get at the relation between significant digits and possible rela- 
tive error, we need consider only numbers between 1 and 10. 

Consider the following examples: 


(5) Decimal 1 2 5 9 10 
(6) Number of significant digits i 1 1 1 2 
(7) Possible error 0.5 0.5 0.5 0.5 0.5 
(8) Possible relative error 

(approximate) 0.5 025 ΟἹ 0.06 0.05 


(9) Order of magnitude of (8) 10-' 107' 107' 107! 107? 


It appears that any decimal accurate to one significant digit has a pos- 
sible relative error whose order of magnitude is 107’. For two significant 
digits the relative errors would be 1/10 as great, and so on. Hence we 
may state that if a decimal has n significant digits, the order of magnitude 
of its possible relative error is 10~”. 


(m) Find the order of magnitude of the possible relative errors of 10, 20, 
30, 50, 70, 90, 12, 99, 100, assuming two significant digits. (n) Precision 
gives us an estimate of the __..___, accuracy an estimate of the 
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PROBLEMS 


In Problems 1 through 10 an approximation and true value are given, in 
that order. In each case find the error, the relative error, the precision, and the 
accuracy. Round off so that all digits are significant, and express in scientific 
notation. 


1. 3.14; 3.13. 2. 31.4; 31.8. 

3. 0.314; 0.313. 4. 3,339,412; 3,000,000. 
5. 256; 259. 6. 100.01; 100.05. 

7. 0.0003818; 0.0003812. 8. 20, 10. 

9. 10, 20. 10. 0.001; 0. 


11. Express in scientific notation: 8.912; 81.92; 0.8192; 819.2; 0.08192; 
0.00008192; 1,300,000; 750,000,000; 1,000,000,000; 0.100,000,000; 8.100,000,001 ; 
358.2 X 1078. 

12. Express without scientific notation: 3 Χ 10; 3.48 X 107; 3.48 x 107; 
3X 10-8; 1.8901 Χ 10-3; 10.000195 Χ 107; 3.00123 Χ 1019, 

%* 13. What is the order of magnitude of the U.S. national debt? 
w%14. Of the U.S. population? 

*15. Of the population of China? 

% 16. Of the average family income in the U. 5. 

* 17. Of the prison population in the U.8.? 

18. Find the order of magnitude of 16.35 - 102, 3,895,113, and 0.00345. 

%* 19. Argue that the order of magnitude of the ratio of two numbers is ap- 
proximately the ratio of their orders of magnitude, and illustrate with examples. 
% 20. Argue by examples that the order of magnitude of the difference of two 
numbers is not equal to the difference of their orders of magnitude. 

*% 21. Show that there is one number for which “order of magnitude” has no 
meaning. 


ANSWERS TO EXERCISES 


(a) approximation; true value. the approximation is less than the true 
value. (b) 1/300. (c) error; true value; error; true value. (ἃ) —1/66 
or approximately 0.017. (6) 0.0005. (ὃ) 0.5. (g) 50. (8) 0.005; 5/3144 
or approximately 0.0016. (i) 0.05; same as in Exercise (ἢ). (7) 3.14; 0.0384; 
99.5; 99.4: 37,300; 63100. (Κ) 4.82 Χ 10; 9.516 Χ 107; 2.97 Χ 10°; 1.99 x 
10-1; 3.846 X 10-2; 3.0016 X 10; 7.4 X 1074; 3.001309 Χ 10°; 10%. ( 891; 
3,000,000; —4397; 0.13 1;0.000004985. (τα) All 10- 2. (n) error; relative 
error. 


ANSWERS TO PROBLEMS 


1. 0.01; approx. 0.0032; precision is to 1 decimal place; accuracy is to two 
significant figures; 3.1; 3.1 X 100. 3. 0.001; approx. 0.0032; precision is to 2 
decimal places; accuracy is to two significant figures; 3.1 1071. 5. —3; 
approx. 0.012; tens; two significant digits; 2.6 X 107. 7. 0.0000006; approx. 
0.0016; 6 decimal places; three significant digits; 3.81 x 10-4. 9. —10; —0.5; 
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hundreds; order of magnitude is all that is known. 11. 8.912 * 10°;8.192 Χ 10!; 
8.192 X 107}, ete. 13. 1011, Note that all we need here is to know that it is 
in the hundreds of billions and less than 500 billion. 15. 108 or 109. 17. 10° is 
a good guess. 


* 6-12 Numerical calculations. We are concerned in this section with 
the problems that arise as a result of the fact that the numbers we usually 
work with in applied mathematics are approximations. Suppose for ex- 
ample that 7 = 7,+ Te, T, = 2.0 = Ay, 7. = 10.12 = Ag. As- 
suming that the digits written are reliable, which is the usual convention 
followed, how reliable is the sum A, + Ag = 12.12 as an approximation 
to Τ᾽ We have 1.95 < 7, < 2.05, 10.115 < 7. < 10.125, from which 
it follows that 12.065 < 7 < 12.175. 


(a) Why? 


We see that in 12.12, the second digit after the decimal point is certainly 
not reliable. At most we can say that the result is very nearly reliable to 
one decimal place. 

Let us try to give a more general answer to questions of this kind. We 
have £, = A; — 7,, Ho = Ag — Ts, E=A-—T= (A; + 49) -- 
(Ty + Te) = (4, — 11) + (42 — Te) = δὶ + 5. That 15, 


(1) The error in a sum 18 the sum of the errors. 


Suppose now that we are adding two numbers, one of which is precise 
to more decimal places than the other. The possible error in the less precise 
number is at least ten times that of the more precise number. Accordingly, 
we could not expect the sum to be more precise than the number of lesser 
precision. If both errors are positive, as in the case above, the sum may 
actually not have even the precision of the least precise number. However, 
usually we do not know the actual errors. They may reinforce or cancel 
one another in any particular case. Evidently, in estimating the precision 
of a sum there is bound to be uncertainty. However, considerable ex- 
perience and theoretical considerations too advanced to be considered here 
suggest the following rule of thumb. 


(2) The precision of a sum is that of the least precise of the numbers 
being added. 


This rule is applied no matter how many numbers are added, and it in- 
cludes subtraction. 


(b) Do you think rule (2) would be more or less satisfactory when applied 
to more than two numbers? (c) Why does (2) ‘apply also to subtractions? 
(ἃ) Suppose 71 = 3.965, Te = 134.20813, Τῷ = 0.0295. Find the exact sum 
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and round it off to units. Round off each number to units and find the sum. 
To what position do they agree? (6) Round off to one decimal place and 
add. Compare the result with the sum rounded off to one decimal] place. (f) Do 
the same for two decimal places. (g) Round off the first to units and add to 
the other two. To what place does the result agree with the true sum? 
(h) Round off the second to units and add to T; and T3. Compare with the 
true sum, 


Can we construct a similar rule of thumb to cover multiplication and di- 
vision? We have Καὶ = 4,4ς — 1T1T2, but there seems to be no easy way 
to relate this to the errors 4: — 7; and 4.9 — 72. However, we may 
introduce the errors by noting that 4) = 7, + &, and Ag = 7.2 + Eo. 
Then Καὶ = (T, + £))(T2 + Ee) — Τί 7 = EyT2 + Bel, + Bike. We 
now have the errors in the equation, but it is still hard to see what the re- 
lation is. However, if we divide both members by 7 = 17,7», we find 
E/T = E,/T, + Eeo/Te + (E1/T1)(2/T2). Now we see that the 
relative error in the product is the sum of the relative errors plus their 
product. But if the relative errors are small, the additional term will be 
of small consequence. Accordingly, 


The relative error in a product 1s approximately the sum of 
the relative errors. 


(3) 


(i) Suppose £,/T; = 0.01 and E2/T2 = 0.04. Find #/T exactly and com- 
pare it with the approximate relative error given by (9). 


As in the case of addition, errors may reinforce or cancel each other in 
multiplications. Since each additional significant digit reduces the pos- 
sible relative error by a factor of 10, it appears likely that if several numbers 
are multiplied 


The accuracy of a product is that of the least accurate of the 
numbers being multiplied. 


(4) 


Rules (2) and (4) serve as satisfactory rough guides as to what extent 
decimals should be rounded off during computations and in final answers. 
It is desirable to round off decimals as much as possible before computa- 
tions to avoid needless work, and if answers are not rounded properly, we 
may either lose information or give an illusion of greater accuracy than 
exists. Frequently computers keep one more digit during computations 
than is required by (2) and (4), then round it off in the final result. Such 
a practice is a hedge against possible reinforcement of small errors. The 
reader should note that precision (number of decimal places) apples to 
addition and subtraction, while accuracy (number of significant digits) 
applies to multiplications and divisions. When both are involved, it may 
be desirable to keep more digits than would be indicated otherwise. 
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Often we desire a rough estimate of the result of a numerical computa- 
tion, either as a check, or to locate the decimal point in a result, or simply 
because we are interested only in getting a rough idea of the answer. 
Consider for example, 


(5) A = (848.31) (14.831) !/?(0.0183) /(25.167). 


About how big is the result? ΤῸ make an estimate, we round off the num- 
bers, making use of scientific notation, to the number of significant digits 
in which we are interested, and then carry out the arithmetic, rounding off 
at each step. Thus, rounding off to one significant digit, we find 


(6) A=8xX 10? x 15"? x 2x 10°7/3 x 10 
(7) = 8 xX 4x2 x3 K107 ἘΞ 10". 


Here to find 8 xX 4 X 2 x 37! we thought “32, 64, 21, 20.” 
The order of magnitude of A is 1, and 2 is an accurate estimate to one 
significant digit, since A = 2.3755 to 5 significant digits. 


Estimate the following to one significant digit and compare with complete 
answer properly rounded off: (j) (2894)(0.0139), (Κ) (18.35)(—0.003)?. 


There are many aids and short cuts in numerical calculations. Among 
these are logarithms, the slide rule, desk computers, and electronic com- 
puters. All but the last are accessible to most students and suitable for 
problems of modest complication. Tables and handbooks give simple 
instructions for using such devices, and we shall not discuss them here. 
However, the rules about reliability and the methods of rough approxima- 
tion described above are helpful when these devices are used. The rough 
methods are used for estimation and checking. 


If we are to calculate A = (0.39517) (4.865) (.00913)(3835)(1.8), () how 
many significant digits should appear in the final result? (m) How many 
significant digits should be carried in the computations? (n) Rewrite the 
factors rounded-off as you would round them before beginning computation. 
(o) Rewrite A with the factors rounded to one digit. (p) What is the order 
of magnitude of A? (q) If the numbers were to be added, how many decimal 
places would you keep in the result? (Ὁ) how many significant digits? (5) 
Round off the numbers as you would before adding. 


PROBLEMS 


In Problems 1 through 6 estimate to one significant digit and compare with 
the complete rounded answer. 
1. ((887.19) (2.8321) (0.489))!/3. 2, (285)4. 
3. 51.88% of 4,892,831.16. 4, (1.01)!9. 
5. (253.8) (0.9514) /(5.9385). 
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6. 2100,000,000 (the number of different states of the human eye’s 100,000,000 
photoreceptors). 

7. Estimate the result in Problem 5 to two significant digits and compare 
with the complete answer. 

8. The approximation 1/(1-+ x) = 1 — 2 is sometimes used. What is 
the error? Under what conditions will it be small? Show that the relative 
error is —2?. 

9, Discuss the approximation (1 -- «)7 (1 + y) =1+2—y. 

10. Discuss the approximation (1 + z)!/? = 1 -Ὁ 2/2. When is it good? 

11. To calculate (50.0004/49.9993)!/7, a chemist rewrote the expression as 
follows: ((50 + 0.0004) /(50 — 0.0007))1/2; ((1 + 0.000008)/(1 — 0.000014))?/?; 
(1.000022)!/2; 1.000011. Explain the manipulations. 

%*12. The human egg has a diameter of about 1/2000 inch. How many could 
be put in a quart bottle without crowding? 

* 13. The light-year is the distance traveled by light in one year. If the speed 
of light is 186,339 mi/sec, find the number of miles in a light year to one sig- 
nificant digit. 

%14. Find the following ratios to two significant digits: 81/80; 125/128; 
531441/524288. (For their significance, see Introduction to Musicology, by 
Glen Haydon, p. 33) 

#15. When is 1 + 2a a good approximation for (1 +- a)*? What is the error? 
the relative error? 

*16. Assume that one digit occupies } inch. How long a strip of paper is re- 
quired to write a decimal with 10100 digits? Suppose that it takes 3 sec to write 
a digit. How long would it take to write it out? Compare the length of the 
strip with the distance to the sun (93,000,000 miles) and the time to the age 
of the earth (about 3 billion years). 

*17. In Statistical Methods, by G. W. Snedecor, the following equation appears 
on page 203: 1,310xz? — 113,303z? + 20,540,068z — 317,722,720 = 0. He 
writes, “A first approximation may be got by ignoring the first two terms.” 
Find the approximation. 

* 18. With the aid of a figure, show that when @ = 0, (sin #)/@ = 1. 

*19. The following equations appeared in a paper on the variability of awards 
in damage cases tried by juries and by judges: 12(1.3574)/n = 1.625; 2.828/m = 
1.3574. Solve for m and n to the nearest integer. 

#20. A. A. Eddington’s “cosmical number,” which he believed to be the num- 
ber of protons and electrons in the universe, is 2 Χ 186 X 27°, or 15,747,724, 
136,275,002,577 605,653, 961 ,181,555,468,044,717,914, 527,116,709, 366, 231 ,425, 
076,185,631,031,296. Show that these numbers are the same to three significant 
digits. 

*21. Discuss the approximation (1 — B?)7~!/? = 1+ 1/2B?. When is it 
good? 

#22, Suppose tan u = A/(s+ δ) If u is small and ὃ is small compared with 
s, justify tan u = h/s. 

*23. From 1/A = R(1/22 — 1/n?), R = 1.097 X 107’, n = 8, find dA. 

* 24. Suppose sin [(4 + a@)/2] = nsin (4/2). If A and a are small, justify 
a= (n— I)A. 


6-13] COMPLEX NUMBERS 369 


ANSWERS TO EXERCISES 


(a) (8-5-42). (0) More, since there would be more errors that might cancel. 
(c) a —b = a+ (—). Subtraction merely changes the sign of the error. 
(4) 138.20263; 138; 4, 134, 0, 138; to units. (6) 4.0, 134.2, 0.0; 138.2; 138.2. 
They agree to one decimal place. (ἢ 3.96; 134.21, 0.03; 138.20. They agree 
to two decimal places. (g) 138.23763. To one decimal place. (8) 137.9940. 
Agrees only to units. (i) 0.0504; 0.05. (7) If the decimals are rounded to 
one digit, we get an estimate of 3 X 10!, which does not agree with the com- 
plete result, 4.02 Χ 10!. Rounding off the decimals to two digits yields 4 x 101, 
as does the estimate. (k) 2Χ 1074; 1.65 X 10-*. (1) Two. (m) Three. 
(n) 3.95 X 10-1 X 4.86 X 9.13 K 1073 Χ 3.84 X 10° X 1.3. 

(o) (0.4)(5)(0.009)(4 Χ 103)(1). (p) 10. (q) None. (ἡ Four. (s) 0+ 
5+ 0-+ 3835 + 1. 


ANSWERS TO PROBLEMS 


1. “The complete rounded answer” means the result of following rules (2) 
and (4). Here one should round off to three significant digits before begin- 
ning. Use cube root tables. 3. 51% = 51/100. 5. Rounding off numbers to 
one digit yields 5 Χ 10!, compared with the complete answer of 4.068 Χ 10]. 
7. 4.1 Χ 10! and 4.07 Χ 101. 9. (1 +a)/1+y) = (( -᾿ “) {7 -Ἡ y)), and 
use Problem 8. Good when both x and y are small. 11. Multiply numerator 
and denominator by 1/50, then use approximations of Problems 9 and 10. 
13. 6X 10!2 mi. 15. When a is small; —a?; —a?/(1 + a)?. 17. 15. 19. 2, 10. 


κό-13 Complex numbers. In Section 4-3 we defined the addition of 
vectors, and the multiplication of vectors by real numbers in such a way 
that the geometric interpretation of these operations was the same as for 
real numbers. We now wish to define the multiplication of two plane 
vectors so as to satisfy the same conditions. When we do so we shall find 
that we have solved the mystery of imaginary and complex numbers! 


(a) Review the definitions of Section 4-3 and the discussion of definitions 
in Section 1-13. 


Our task now is to decide on an appropriate definition of multiplication 
of plane vectors (ordered pairs of reals). Of course, we are free to choose 
this definition as we wish, but, if possible, we desire that addition and 
multiplication of vectors follow the usual laws (such as the commutative, 
associative, and distributive laws) and that multiplication has the same 
vector interpretation as for real numbers. We defined the sum of two vec- 
tors as the vector obtained by adding their components. Suppose we de- 
fined the product of two vectors as the vector obtained by multiplying 
their components, that is, (a, b)(c, d) = (ac, bd)? It is easy to show that 
this gives a unique product which is also a vector and that the commuta- 
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tive, associative, and distributive laws are satisfied. Moreover, (0, 0) 
plays the role of 0, and (1, 1) plays the role of 1, so that (1-16-1) through 
(1-16-9) are satisfied with “vector” in place of “real number.” But 
(1-16-10) is not satisfied, since, for example, the vector (2, 0) is not (0, 0) 
and yet it has no reciprocal. 


(b) What do we mean by saying that (1, 1) plays the role of 1 with this defini- 
tion of multiplication? (0) Show that with this definition of multiplication 
~ dx dy (2,0)(2,y) = (1, 1). 


Since the situation seems more complicated than for addition, we may 
take another approach. Let us imagine that we have somehow defined 
multiplication so that the laws hold and see what conclusions we can draw. 
Then we have 


(1) (a, 6)(c, d) = [(a, 0) + (0, δ)} - [(ο, 0) + (0, d)] (4-3-2), 
(2) = [a(1, 0) + 0, I)]led, 0) + d0,1)] (4-8-7), 
(3) = ac(1, 0)(1, 0) + ad(1, 0)(0, 1) 

+ be(O, 1)(1, 0) + bd(O, 1)(0, 1) (1-12-7). 


Our probiem is reduced to deciding on how to multiply together the 
vectors (1,0) and (0,1). That is, we can solve our problem by deciding 
on definitions for (1,0)(1,0), (1, 0)(0, 1), (0,1)(1, 0), and (0, 1)(O, 1). 
Here the concept of isomorphism will be helpful. The vectors with second 
component zero are isomorphic to the real numbers with respect to addi- 
tion under the correspondence F(x) = (2,0), for we have F(a + δ) = 
(a + 6,0) = (a, 0) + (6,0) = F(a) + F(b). In other words, the vec- 
tors with second component zero behave exactly like the real numbers 
under addition. Moreover, F(0) = (0,0) and F(1) = (1,0). This is to 
be expected, since we began by interpreting real numbers as vectors on 
an axis. 


(d) Review the discussion of isomorphism in Section 6-8. 


It would be natural, then, to define multiplication of vectors so that the 
vectors (x,0) were isomorphic to the reals under multiplication also. 
This would require F(ab) = (ab, 0) = F(a)F(b) = (a, 0)(b, 0). In par- 
ticular, we would have (1, 0)(1, 0) = (1,0). From this isomorphism we 
see that (1, 0) plays the role for vectors that 1 does for real numbers, for 
(1, 0)(z, 0) = (2, 0), and there can be only one number with this property 
in any number system satisfying the axioms of Section 1-16. Hence we 
would expect (1, 0)(x, y) = (a, y) for any x and y. In particular we have 
(1, 0)(0, 1) = (0, 1) and, by commutativity, (0, 1)(1,0) = (0, 1). 


(6) How is Problem 30 in Section 2-11 related to the above? 
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(0, 1), (1, m/2)p 


FIGURE 6-3 FIGURE 6-4 


There remains only (0, 1)(0, 1) or (0, 1)”. How shall we define it? An 
appropriate decision is suggested by the following considerations. Accord- 
ing to (3-9-13), the absolute value (length of the corresponding vector) 
of the product of two real numbers is the product of their absolute values. 
Accordingly we might hope that the length of the product of two plane 
vectors would be the product of their lengths. This is indeed the case in 
the three products considered so far. For example, the lengths of (1, 0), 
(0, 1), and their product (0,1) are all 1. Hence we would expect the 
length of (0,1)? to be 1. Using polar coordinates, we have (0, 1)* = 
(1: 1/2)5 = (1,6). 

(ἢ Review polar coordinates in Section 4~10. 


Our problem now appears to be one of deciding on the relation between 
the direction angle of a vector and the direction angle of its square. The 
square (—1, 0)? is suggestive. Because of the isomorphism between vectors 
of the form (x, 0) and the corresponding real numbers x, we expect that 
(—1, 0)? = (1, 0), as sketched in Fig. 6-3. In polar coordinates we have 
(1,7)2 = (1, 27)». This suggests that the direction angle of a square is 
found by doubling the original direction angle. This conjecture (backed at 
present by only one example!) suggests (0, 1)? = (1, 1. ,2}2.ΞΞ Ot), = 
(—1,0). (See Fig. 6-4.) Replacing the products in (3) by the expressions 
found in the last few paragraphs, we find 


(4) (a, b){c, d) = ac(1,0) + ad(O, 1) + be(0, 1) + bd(—I, 0), 
(5) = (ac, 0) + (0, ad) +- (0, be) + (—bd, 0), 

(6) = (ac — bd, ad + be). 

This suggests the following definition. 

(7) Def. (a, b)(c, d) = (ας — bd, ad + be). 


(z) Use definition (7) to find (0, 1)(0, 1), (2, 1)(8, —2), (—1, 0)(5, —1), and 
(0, —1)?. 
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From (7) it is not difficult to prove that plane vectors satisfy all the 
axioms (1-16-1) through (1-16-10) with “plane vector,” “(0,0),” and 
“(1,0)” substituted for “real number,” “O,” and “1” throughout. The 
proof is based on the assumption that the reals satisfy the axioms. It 
follows that plane vectors (ordered pairs of reals) with these definitions 
of addition and multiplication satisfy all the laws of algebra derivable 
from these axioms, that is, all the laws except those related to order. In 
such a context it 1s customary to call plane vectors (ordered pairs of 
reals) complex numbers. 

The relation between the complex numbers and the reals is that there 
exists a subset of the complex numbers, namely those with second com- 
ponent 0, that is isomorphic to the real numbers with respect to the opera- 
tions of addition and multiplication, under the correspondence F(x) = 
(z,0). Thus F(1) = (1,0), F(—1) = (—1,0), F(O) = (0, 0), and so on. 
Because of this isomorphism, it 1s customary to write simply “1” for 
“(1,0),” “—1” for “(—1, 0),” and generally “x” for “(ὦ, 0).” 


(h) Why does this practice cause no difficulty even though, strictly speaking, 
1 and (1, 0) are not identical? 


From (7) it is now easy to prove the key conjecture we used in deciding 
on (7), namely 


But (—1, 0) corresponds to —1 under the above isomorphism. Hence 
this may be written 
(9) (0,1)? = —1. 

It is convenient to represent complex numbers as sums of their com- 
ponent vectors. According to (4-3-9), (a, 6) = a(1,0) + δ(0, 1). Using 


“1” in place of “(1,0)” and adopting the letter “c” as a name for “(0, 1),” 
we have | 


(10) Def. i = (0,1), 
(11) (a,b) — at bi. 


This expresses a complex number (ordered pair of reals) as the sum of a 
real number (a) times the unit real vector (1 or (1, 0)) and a real num- 
ber (Ὁ) times a unit vector ( or (0, 1)) called the «maginary unit vector. 


(i) Prove (12). 
(12) 17 = —1, 


(1) What is the other square root of —1? 
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In the complex number (a, b) we called a the real part and b the imaginary 
part. Note that the so-called imaginary part is as real as the real part! 
Complex numbers whose imaginary parts are zero are called real numbers, 
though actually they are merely isomorphic to the real numbers. They 
may be written in the form a -+ 02, (a, 0), or a. Complex numbers whose 
imaginary parts are not zero are called imaginaries. Those whose real 
parts are zero are called pure imaginaries. They may be written in the 
form 0 + δὲ, (0, δ), or δὲ. These terms have only historical significance, 
since an ordered pair of real numbers (a complex number, a point in the 
plane, a plane vector) is as real (possibly even twice as real!) as a single 
real number. 


(k) Plot the following points: 2, 27, 1, —i, 2-+ 4%, —1-+ 7. (1) Give four 
interpretations of “(2, 3),” indicating in each case the terms applied to 2 and 3. 


Since all the laws of algebra apply to complex numbers, we may operate 
with their names in the form a + 7b with the usual rules of algebra, keep- 
ing in mind, of course, the law (12). For example, using the usual manipu- 
lative rules we find (a +- bi)(e + dt) = ac + adi + bei + bd? = ace — 
bd + (ad + bc)i, as we would expect according to (7). 


Carry out the following operations, plotting the complex numbers involved. 
Repeat with numbers written in the forma-+ bi. (m) (2,0)(8, 4), (ὦ) (1, 1) 
(1, Ξ 1). (0) (3, 4) (2, 1). 


We can now prove our conjecture that the direction angle of a square 15 
twice the original direction angle. Let (x, y) = (7, 9)». Then by (4-11-2), 
x = rcos@and y = rsin 6. Hence 


(13) [(a, y) = (7, pl © [x + yi = r(cos 6 + isin θ)]. 


The right member of the second equation is said to be the polar form of 
writing complex numbers, while its left member is said to be in rectangular 
form. Using the polar form, we have 


(14) (r, θ})2 = (r(cos 9 + 1 sin @))? 

(15) = r7(cos?@ — sin?@ + 72 sin 6 cos 6) 
(16) = r*(cos 20 + isin 26) 

(17) = (r?, 20). 


We see from this that the length of the square is the square of the length 
and the angle of the square is twice the angle. 


(p) Explain the transition from (14) to (15), (q) from (15) to (16). (r) What 
is the reason for (17)? 
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The above result can be generalized in two ways. First we may gen- 
eralize by taking the product of two different complex numbers and prove 
that | 


(18) (7161) p(r2, θ2)» = (Tir2, 01 + 62)p. 
In words, the length of the product is the product of the lengths and the 
angle of the product is the sum of the angles. 


(8) Prove (18) by continuing the following: (71, @1)(re, 02) = (ri(cos 6; + 
2 sin 61)(re(cos θα -+ isin @2)) = °°: 


Second, we may generalize the exponent by writing 
(19) 8 5 ΞΞ δ}: 


This is known as DeMoivre’s theorem and can be proved to hold for all 
numerical values of n, including complex exponents when these have been 
defined properly. The proof of (19) for n a natural number 1s by induction. 
The verification is immediate. For the induction we have 


(20) (7, 057) = (1, O)p(r, 8)} 


(21) a ΘΟ δ (Induction assumption) 
(22) = (r™*1 6+ n6)y (18) 
(23) = (r®t1 (n + 1)6)>. 


But the last equation is just (19)(n:n + 1). Hence the induction is com- 
plete and the theorem is proved. 


(t) Does it matter that we have used the variable n in the induction instead 
of x as in Section 6-5? (u) Write out (17), (18), (19) with the complex num- 
bers in polar form. (v) Complete the following: The natural numbers are 


closed under addition and multiplication but not πάθος. ς᾽’ and : 
The integers are closed under addition, subtraction, and —________, but. not 
closed under ___--. The rationals are closed under —___-. but. not 
πᾶ. ,. The reals are closed under —__..___. but they are not closed 
under _____ since 

PROBLEMS 


In Problems 1 through 10 carry through the operations by using the defini- 
tions. Then rewrite in rectangular form and do the operations by the usual 
algebraic rules and compare your results. Sketch the numbers involved and 
verify (18). 
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1. (2, —1)(3, 0). 2. (0, —~3)(1, 2). 3. (3, 4)? 
4, (1,1)(1, —1). 5. (—1, 2)2. 6. (1, —2)2 
7. (0, 1)(0, —1). 8. (1, 193. 9. (—0.5, 0.5+/3)3. 


10. (a, b)(a, —6). 


11. With definition (24) prove that 1/(a, 6) is the reciprocal of (a, δ) called 
for by (1-16-10). 


(24) Def. 1/(a, δ) = (a, —b)/(a? -+ 07). 


12. Find 1/(a, δ) by multiplying numerator and denominator of 1/(a -+ δ) 
by a — δὶ. 

13. Show that the length of the quotient is the quotient of the lengths and 
the angle of the quotient is the difference of the angles. State and prove a rule 
for reciprocals. 

#14. As for real numbers, we call any complex number z such that 25 = w an 
nth root of w. The root with smallest non-negative direction angle is called the 
principal nth root. For example, 12 = —1 and (—i)? = (—1)?7? = —1, so 
that 7 and —7 are square roots of -—-1, but 2 is the principal square root. Sketch 
the two square roots of 1 and the four fourth roots of 1. 

* 15. Show that 1, —0.5 + 0.531, and —0.5 — 0.537 are cube roots of 1. 
* 16. Find the cube roots of —1. 

*%17. Show that when x > 0, ἦν and —iv/z are the square roots of —z. 

* 18. We now show that any complex number (except 0) has just n distinct nth 
roots. Suppose we wish (8, @), to be an nth root of (7, #)p. We have (8, φ)"} = 
(s", nb) = (r, 8), where we omit the p since we shall be using only polar co- 
ordinates. From this we have s* = 7, where 8 and r are real and positive since 
they are lengths and we are excluding the null vector. Hence s = r!/, the prin- 
cipal nth root of r. By Problem 12 in Section 4-10 we have n@ = @-+ 2ζπ, 
where k is some integer, or Φ = 0/n-+ 2kw/n. Hence (7), 6/n + 2kr/n)p is 
an nth root of (r, 6), for any such k. All these roots have the same length, and 
therefore the corresponding points lie on a circle of radius r!/" and center at the 
origin. If @ is the smallest non-negative direction angle of (r, 0)», then the 
- principal root is (r!, 6/n). The others are then obtained by going counter- 
clockwise around the circle through successive angles of 27/n. Taking k = 
0,1,2,...,n— 1, we get all possible distinct points, since beginning with 
k = ἢ, we reach the same points as before. Hence 


(5, d)p = (1, 8)» 


(25) | lin 

OAkKkKeETJ A (s,¢) = (r , O/n4+ 2kr/n). 

Verify that the formula gives the three cube roots of 1 found in Problem 15 
and the four fourth roots of 1. 

*19. Find the five fifth roots of 1 and sketch them. (Suggestion: Express them 
in polar form.) 

* 20. Find the sixth roots of 1 by (25) and also by factoring z®© — 1 = 0. 
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*21. The complex conjugate of a + δὲ is defined as a — bi. The complex conju- 
gate of the complex number z is written 2. What is the geometric relation be- 
tween a number and its conjugate? When does z = 
*%22. Show that z-+ w = 2+ 30,27 —w = Z— τὸ, zw = 2B, (2/w) = 2/B, 
on ἘΞ 2" and. 32 = |2/7, 

* 23. By expanding (cos @-+ isin 6)? and using DeMoivre’s theorem, find 
formulas for cos 3@ and sin 3@. 

% 24. Under what operations are the complex numbers closed? 

*25. The closure of the complex numbers is even broader than is suggested 
by the answer to Problem 24. Indeed, any algebraic equation whose coefficients 
are complex numbers (one of the form a,2”-++ a,—14"~ 1+ ++++ α12 -Ά ao = 0, 
where a, * Ὁ and where the a’s are complex) has at least one root that is a com- 
plex number. (It may have as many as n distinct roots.) Study the proof of 
this result, known as the Fundamental Theorem of Algebra, given in What Is 
Mathematics?, by R. Courant and H. Robbins, Chapter II, Section 5. 


2) zg = —2Z? 


ANSWERS TO EXERCISES 


(b) (x, y)(1,1) = (α, ψ) is a law. (ὁ) With this definition, (2, 0)(z,y) = 
(2x, Oy) = (25, 0) ¥ (1,1) no matter what values of x and y we choose. 
(6) Problem 30 in Section 2-11 states that there is one and only one number 
with the property (1-16-8). Hence if we wish the complex numbers to satisfy 
the same laws as the reals, this must hold also. (6) (—1, 9), (8, —1), (—5, 1), 
(-- 1,0). (h) We are using “x” as an abbreviation for “(z,0),” but we 
will never get anything contradicting what we would expect for “x” repre- 
senting a single real number, because the isomorphism means that (2, 0) and 
x have identical properties with respect to the operations of algebra. (1) Find 

(1) The point with coordinates 2 and 3; the vector with components 2 and 3; 
the complex number with real and imaginary parts 2 and 3; the ordered pair 
with first member 2 and second member 8. (m) (2:3 —0-4,2-4-+ 0-3) = 
(6, 8); 2(3 + 47) = 6-4 81. (n) (2, 0), 2. (o) (2,11), 2 - 11}. 
(p) (1-12-9) and ἐ2 = —1. (4) (4-12-22), (4-12-23). (Ὁ) (13) (r:r?, 8:28). 
(s) Multiply, collect terms involving 7 together, and use (4-12-9) and (4-12-10). 
(t) No. It is a dummy in the induction. See Sections 2-6 and 2-11. (u) De- 
Moivre’s theorem becomes (r cos 6 + ir sin 6)” = r”(cos ηθ + isin né). 


ANSWERS TO PROBLEMS 


11. Multiply by (a, 6) and show that the product is (1,0). 13. Use (24), 
(4-12-17), and (4-12-18). 


CHAPTER 7 
CALCULUS 


7-1 What is calculus? Calculus had its origins in a number of problems 
that have troubled scientists and philosophers since ancient times. At 
first sight these problems appear to be quite different, but it turns out 
that they have a common character and can be attacked by very similar 
methods. 

First, consider the problem of finding the area of an arbitrarily given 
region in a plane. This is easy if the region happens to be bounded by a 
rectangle, parallelogram, triangle, trapezoid, or polygon. Indeed, by using 
a formula for the area of a triangle we can easily find the area of any 
region that is the union of triangular regions. 


(a) How does one find the area of a polygon? 


But how can we find the areas of regions that cannot be divided into 
triangular regions? What about regions bounded by circular arcs, ellipses, 
sine curves, and so on? In very ancient times, mathematicians already 
took the correct path to the solution of this problem. They approximated 
areas within curves by replacing the curves with numerous straight line 
segments joining points on the curves. For example, they approximated 
the area of a circle by inscribing regular polygons whose areas could be 
calculated. As is evident in Fig. 7-1, the larger the number of sides, the 
closer the approximation. None of these approximations is the exact area, 
but it appears reasonable to say that as the number of sides increases, the 
area of the polygons approaches the exact area of the circle. We may 
think of the area within a closed curve as a number that is approximated 
arbitrarily closely by the area of inscribed polygons. 


FIGURE 7-1 
377 
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B 


FIGURE 7-2 


A somewhat similar problem is that of length. We know how to find the 
length of a straight line segment or of a broken line (curve made up of con- 
nected straight line segments). But what of an are on a circle, ellipse, 
hyperbola, or other curve not made of segments? It seems reasonable 
here to approximate the curve by a broken line and consider the total 
length of such a broken line as an approximation to the length of the curve. 
As suggested in Fig. 7-2, by choosing the endpoints of the segments close 
enough together on the curve, we may make the broken line arbitrarily 
close to the curve. 

A third problem, which appears to be quite different from the first two, 
is that of finding the exact velocity of a moving body. Of course this is 
easy if the body is known to be moving at a fixed velocity, for we have 
velocity = distance/elapsed time. But this relation does not hold when 
velocity is changing. Imagine a body moving along a straight line, and 
let y be its directed distance from a fixed point on this line. Let x be the 
corresponding time. We assume a function f such that y = f(x). Suppose 
at time x, the body is at y; and at time x2 the body is at yg. In an elapsed 
time of x2 — 2, the body has moved a directed distance of yg — ψι. If 
we know that it moves at fixed velocity, this velocity is evidently 
(yo — y1)/(€2 — 11) or (f(X2) — f(r1))/(@2 — 21). But if the velocity 
is changing, the most we can say is that this ratio is the average velocity 
during the time interval (x, x2). What do we mean by the exact velocity 
at the time x,? It seems natural to say that we could get a very close 
approximation to it by choosing t2 very close to x;. We may define the 
exact velocity (or instantaneous velocity) as the limit of the average velocity 
as the time interval is shortened toward zero. 


(b) Why not let z2 = 21? (ὁ) What is the interpretation if z2 > 21 
and γὼ < yi? 


To visualize the problem of velocity graphically, we may graph the 
function f relating time to distance y as in Fig. 7-3. The average velocity 
(ψ2 — Y1)/(t2 — 11} is just the slope of the line joining the two points 
on the graph of f. Thus the average velocity appears as the slope of a secant 
line. As we let the point (2, y2) move closer to (x1, y;), the secant line 
swings around and approaches the tangent line 7. Its slope, then, must 
approach the slope of the tangent line, which is the geometric counterpart 
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(Ὁ, yd) (XQ, Ya) 


(ii; ¥1) 


FIGURE 7-3 


of the exact velocity. Hence finding the exact velocity is equivalent to 
finding the slope of a tangent line. 

We can generalize the above. Let us imagine a function f given by 
y = f(x). Jf x changes from 2, to 2g, it has an increment of to — 24. 
At the same time, y changes from f(x) to f(v2) by an increment f(x2) — 
f(z 1). The ratio of the change in y to the change in z is (f(r2) — f(x1))/ 
(2 — x1), which we call the average rate of change of y (or f(a)) with 
respect to x in the interval (x; 12). If we hold 2; fixed and let ry change so 
that it approaches σι, the average rate of change may approach a number. 
If it does so, we call this number the exact rate of change of y with respect to 
x and x,. Geometrically it is the slope of the tangent line to the graph of f 
at the point (x1, y1). 


(4) To determine the tangent line at a given point on a curve is it sufficient 
to know the slope? 


As a final example of problems giving rise to calculus, we mention one of 
Zeno’s famous paradoxes, that of the runner in a race. Some twenty-five 
centuries ago Zeno argued that a runner can never reach the finish line. 
For, Zeno reasoned, the runner must first go half way. Then he must go 
half of the remaining distance. Having arrived at the three-quarter mark, 
he must go half of the rest, and so on. In this way he must cover an 
infinite number of smaller and smaller distances without ever arriving 
at the finish. The situation is illustrated in Fig. 7-4, where we take the 
total distance to be unity. 

Let y; = the length of ith interval covered by the runner. Then y; = 
1/2, yo = 1/4, ys = 1/8, ys = 1/16, and for any 7, y = 2. The y; 


ΒΕ ΞΕΕ 
0 1 ϑ i la J 
2 4 8 16 


FIGURE 7-4 
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are terms of a geometric progression with common ratio 2. Lets, = the 
sum of the first πὶ terms. Then 8, = 1 — 2~”. Let us assume that the 
runner moves with a constant velocity equal to 1. Then let ¢; = the time 
required to cover the 7th interval. It is easy to see that ἐξ = y; so that if 
we let Τ᾿, = the time required to cover the first n intervals, we have 
T, = 1— 2. Evidently there is here no question of the runner re- 
quiring an infinite time to reach his goal, for the Τὶ are bounded from 
above by 1. 


(e) Justify these results by reference to Section 6-7. 


But there is a real difficulty here. Zeno has divided the segment of unit 
length into an infinite number of subdivisions. How can an infinite number 
of segments add up to a finite segment? No one of the s, is equal to 1. 
Does it make sense to speak of the sum of all the y;? Does 1/2 + 1/4 + 
1/8 + 1/16 + 1/32 +... ad infinitum = 1? If we can answer these 
two questions in the affirmative, we shall have a complete answer to Zeno. 
We can say to him that even when the unit interval is sliced into an in- 
finite number of subintervals, their sum is still unity. If he wishes to think 
of the runner passing over an infinite number of intervals of decreasing 
length, the time required for each interval decreases also, and the sum of 
all the times is still one. 

To achieve these-results we have to define what is meant by adding to- 
gether an infinite number of numbers. To do so we use the idea of ap- 
proximation. The sum of the first n times is given by T, = ye i = 
1— 2°”. Asn gets larger this clearly gets closer and closer to 1. We then 
define the sum of all the ἐς, that is, of all the terms of the sequence, as 1. 
More generally, we define the sum of all the terms y; of any sequence as 
the limit (Gf any) approached by 8, (the sum of the first n terms) as ἢ 
gets large. | 


(f) Why does Τί, approach 1 as n gets large? (g) Study the infinite geo- 
metric series 1/3 + 1/9 + 1/27 +.... Find a formula for the sum of the 
first n terms. What would you guess as the sum of all the terms? 


The reader must have observed that these problems, though different 
in origin, have several things in common. In each case we have a concept 
that is easy to handle in simple cases (area of rectilinear figures, length of 
a segment, constant velocity, secant lines, average rate of change, sum of 
a finite number of terms of a sequence), but which does not yield to the 
same methods when we try to generalize (area of a region, length of a 
curve, changing velocity, tangent lines, exact rate of change, sum of an 
infinite number of terms). The difficulty appears to lie in our seeming 
inability to do more than approximate what we seek. Indeed, in each 
case it is not even clear that we know what we are looking for until we 
can formulate a clear definition (of area, length, rate of change, sum) that 
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will apply to the general case. The way out appears to lie in making use 
of the idea that a number may be defined in some way as a limit of 
approximations. 

The characteristic feature of calculus is the use of the concepts of func- 
tion and limit to attack problems along the lines suggested here. The 
purposes of this chapter are (1) to give the student an intuitive apprecia- 
tion of the fundamental ideas of calculus, (2) to suggest how a rigorous 
theory could be constructed, (3) to familiarize the student with the tech- 
niques and applications of calculus in simple situations. 


ANSWERS TO EXERCISES 


(a) It can always be done by adding the area of triangles. (Ὁ) The ratio 
is then undefined. (6) y is decreasing, the velocity is negative. (d) See 
Section 4-5, Exercise (ᾳ). (e) (6-7-25) (f) 1/2” clearly can be made arbi- 
trarily small by taking n large enough. (g) Sum is 1/2. 


7-2 Rates of change. We begin by considering the problem of exact 
rate of change and the related problem of tangent lines. We have already 
treated (Section 4-5) the very simplest case in which z and y are related 
by a linear function mJ + bso that y = mx-+b. There we found that the 
change in y divided by the change in x in the interval (v1 22), (yo — y1)/ 
(x2 — 14}, is m, the slope of the straight line graph of mJ + ὃ. This 
ratio is the average rate of change of y with respect to x in the interval. 
It is also the exact rate of change, since the ratio is the same for any choice 
of the two points. 


(a) Suppose a body moves with constant speed m along a path in which its 
distance from a fixed point along the path is y and the corresponding time is 2. 
What function relates time to distance if its distance from the fixed point is b 
when x = 0? (Ὀ) Review Section 4-5. (c) What is the significance in 
terms of rate of change and slope of m < 0,m = 0,m > 0? 


mi + ὃ 
(0, b) 


Mileage 


Figure 7-5 
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Figure 7—5 illustrates the discussion of Exercise (a). We imagine an 
automobile moving along a road with its position given by y and the time 
by x. Distances are measured from the point A. The car is at point Δ, 
a distance of b from A, when x = 0. Point C suggests an unspecified time 
x and the corresponding distance y. On the speedometer the arrow points 
to the speed m and the dial shows the distance y (assuming the mileage was 
set to zero at A). The graph shows the function relating x and y. Here 
m appears as a slope. 

(d) At what time is the car at A, assuming always a maintained speed of m? 


The idea that a car can maintain a fixed speed for more than a very short 
time is certainly not very realistic. Suppose, then, that our car moves at 
various speeds during different time intervals but maintains In each interval 
a uniform speed. Let the car start at the point B at time x = 0. Assume 
that it goes 60 mi/hr for 4 hr, then 20 mi/hr for 12 min, stands still for 
24 min, and finally goes at 40 mi/hr for 54 min. Measuring time z in hours 
and distance y in miles, we see that when x = 0.5, y = 380, when x = 
0.5 + 0.2 = 0.7, y = 30+ 20(0.2) = 30+ 4 = 34, whenz = 0.7 - 0.4 = 
11, y= 34+0= 34, and when ὦ. Ξξξῷ 1.1 - 0.99 -τΞῷ 2, y= 344+ 
40(0.9) = 70. These points are sketched in Fig. 7-6. Since the car is 
traveling at a constant speed in these intervals, its position is given by the 
piecewise linear function joining these points. Moreover, the slope of each 
segment is the speed during that interval. For example, in the time in- 
terval (0 30) the speed is 60. The function f relating time and distance 
is graphed as a solid line. The function Df, giving the slope of the graph of 
f (or the rate of change, or the velocity), is graphed with dashes. 


¥y 


(0.7, 34) 


(1.1, 34) 
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FIGuRE 7-6 
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(6) What is the slope of f in the interval (0.5 0.7)? How is this reflected 
in the graph of Df? (ὃ Why does the graph of Df lie along the z-axis in the 
interval (0.7.1.1)? (6) What is the speed at the time x = 0.5? (8) In 
Fig. 7-6 what is Df(0.6)? Df(1.5)? Df(1.1)? 

The problem just discussed is not very realistic because of the situation 
at the corner points of the piecewise linear function. What is the speed at 
the time « = 1.1? What does the speedometer show at this instant? 
Actually, if a car could really change instantaneously from 0 to 40 mi/hr, 
the speedometer would be shifting znstantaneously from 0 to 40 and would 
have no defined position. In terms of the graph, there is no defined slope 
at (1.1, 34), since at any point (however close) toward the left of (1.1, 34) 
the rate of change (slope) is 0, but at any point toward the right it is 40. 
We see then that the domain of Df does not contain the values of x at the 
corner points 0.5, 0.7, 1.1. We say that the rate of change is not defined 
there. 


Let f be the piecewise linear function with corner points (0,0), (0.5, 25), 
(0.8, 31), (1.2, 31) and (2.2, 80). (i) Graph it. (j) On the same axes, graph 
Df, the function giving the slope of the graph of f. (k) Interpret your results 
in terms of an automobile as we did. (1) What are the domain and range of f? 
(m) of Df? (n) What is Df(1)? 

Let us now try to be still more realistic in describing an automobile 
whose movements are approximated in Fig. 7-6. Even the most extreme 
hot-rod driver cannot “jack rabbit” his car from a sfinding position to 40 
mi/hr instantaneously, as Fig. 7-6 indicates at the point (1.1, 34). It 
actually takes some time for a car to change its speed. Figure 7-7 sug- 
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gests what actually happens. For any value of x, F(x) represents the total 
distance traveled and Df(zx) represents the speed at the time x (the reading 
on the speedometer). When x = 0, both distance and speed are zero and 
the graphs of f and Df are together. The speed, given by the dotted line, 
gradually rises and then falls during the interval (0 0.5), so that its 
average is 60. Note that where the graph of F is steepest, the graph of DF’ 
has its highest point. This is the time when the speed is maximum. As 
the driver slows down before the second interval, his speedometer reading 
falls, so that during the second time interval it averages 20. When the 
car comes to a stop, the graph of DF reaches the x-axis where DF(x) = 0 
and stays there until the driver again accelerates so as to achieve an average 
speed of 40 in the last interval. We have assumed that he is again slowing 
down toward the time x = 2. 


(0) Why is the graph of F steepest where the speed is greatest? (p) How 
does the graph show the driver slowing down toward the end? 


Figure 7-7 is a fairly realistic picture of a possible situation. For each 
value of x, DF (x) is the exact speed as given by the speedometer reading. 
There are no points where DF (2) fails to exist. To see more clearly the re- 
lation between F and DF in Fig. 7-7, we approximate the smooth curve F 
by a piecewise linear function g consisting of small segments. This is sug- 
gested in Fig. 7-8, where we make the approximating segments rather 
long, since short one can hardly be distinguished from the smooth curve. 
Then Dg is the step function shown in the figure. In any interval where 
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the graph of g is a segment, Dg(x) is the slope of that segment. Now let us 
imagine the number of segments to be very large so that g differs from F 
very little. Then Dg will consist of many little steps approximating DF’, as 
suggested for a part of Dg in Fig. 7-9. The segments making up g prac- 
tically coincide with the curve F, and their slopes are practically the same 
as the slopes of the tangent lines to F in each interval, as suggested in 
Fig. 7-10. Hence it seems reasonable to say that DF (4) is the rate of change 
of F(x) at the point (x, F(x)), that is, the slope of the line tangent to F at 
the point (x, F(x)). 


(q) Trace Fig. 7-7 on a piece of paper. Join the points on the curve for 
which xz = 0, 0.2, 0.4, 0.6, 0.8, 1.0, 1.2, 1.4, 1.6, 1.8, 2.0 by straight line seg- 
ments, and call this graph G. Estimate the slopes of each segment by estimat- 
ing the coordinates of the endpoints. In each interval draw the graph of DG 
such that DG(r) = the slope of the segment in the interval in which z lies. 
Compare with Figs. 7-6 and 7-8. Why is DG(x) undefined at endpoints of 
segments? | 

The procedure we used for finding DF approximately when F is given 
is often useful. From the graph of F we can estimate DF by observing that 
DF (x) is approximately the slope of a short segment Joining two points on 
F near (x, F(x)). However, it is desirable to have more accurate methods 
of finding rates of change. For this purpose we introduce precise defini- 
tions of “rate of change” and “tangent” in the following sections. Here we 
consider a particular simple case in order to familiarize the reader with 
our intentions. 

Consider the function defined by y = x”. Figure 7-11 shows a very 
much enlarged portion of its graph in the neighborhood of (3,9). We 
label the tangent line at (3,9) Τ. The secant lines joining (3, 9) to three 
other nearby points are labeled S,, S2, and Ss. By (4-5-11) with (#7, y;) = 
(3, 9), we find that the slopes of S1, Se, and S3 are 7, 6.5, and 6.1. Follow- 
ing is a table showing the slope of secants for points still closer to (8, 9). 
The first column gives the z-coordinate of the second point on the secant, 
the second column the corresponding value of y, the last three columns the 
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Figure 7-11 


differences of the coordinates and the slope. Slopes of secants determined 
by letting the second point lie at the left are included also. The table shows 
that as we let the second point approach the first, the slope of the secant 
appears to approach the value 6. 


22 — © (y2 — y1)/(t2 — 21) 


0.0601 
9.006001 0.006001 
9.00060001 0.00060001 0.0001 


6.25 —2.75 —0.5 
8.41 —0.59 1 ΞΞΘῸῚ 
: 8.9401 —0.0599 —0.01 
.999 | 8.994001 —0.005999 —0.001 
.9999 | 8.99940001 | —0.00059999 | —0.0001 


It appears that if we take a sequence of secant lines, Si, Se, S3,... 
passing through (3, 9) and another point that moves toward (3, 9), the 
slope of the secants approaches 6. Since the sequence of secants approaches 
coincidence with the tangent line at (3, 9), we suppose that the slope of 
this tangent line is 6. In other words, if y = x”, the instantaneous rate 
of change of y with respect to x at (3, 9) is 6. 


(r) Consider the graph of y = 2? at 2 = 2. Find the slopes of the secant 
lines joining (2, 4) to other points as we did above. Make a table for x2 = 3, 
2.5, 2.1, 2.01, 2.001, 1.5, 1.9, and 1.99. What is the slope of the tangent line? 
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This sort of procedure could be used whenever we know a defining for- 
mula for a function. We could calculate slopes of secants for a sequence of 
values of the variable getting closer to the value in which we are interested, 
then guess the limit. However, by using algebra instead of arithmetic, 
we can get more precise and general results. 

Consider f defined by y = x”. Let (x, y) be a point at which we wish 
the instantaneous rate of change of y with respect to x, that is, Df(x). 
Let (x + ἃ, (x + h)”) be another point on f, where ἢ may be > 0 or < 0. 
Here x} = ὦ, 22 = x +h, and x2 — x; = ἢ. The slope of the secant 
is given by 

ΟΞ (@+th?—2? — thea +h _ 


It is obvious that as A approaches zero (i.e., the secant approaches the 
tangent), m approaches 2x. In brief, as ἢ — 0, m - 2z, where we read 
the arrow as “approaches” or “goes to.” For example, when z = 3, we 
have m — 2" ὃ or 6, as we found above.* 


(s) Verify that (1) holds in the table above. (t) Why not let h = 0 in (1)? 


For any value of x the formula 2z gives the slope of the line tangent to 
the graph of y = x” at the point (xz, x”). For example, when x = —1, the 
slope is 2(—1) or —2. Hence the line tangent at the point (—1, 1) to the 
graph of I? isy — 1 = —2(x+ 1). 

(u) Sketch J? and tangent lines at x = 0,2 = 8,2 = —1. Write the equa- 
tions of the tangent lines there. Do their slopes seem consistent with your 
drawing? 


The formula 2z, giving the slopes of tangent lines to the graph of J’, 
is a defining formula for the function 21. We call 21 the derivative of 13. 
More generally, a function under which the image of z is the slope of the 
line tangent to the graph of f at the point (2, f(x)) is called the derivative 
of f and is designated by Df. Df(x) is the slope of the tangent line to the 
graph of f at (x, f(x)) and also the exact rate of change of f(x) with respect 
to x. We may summarize our discussion in the law 


(2) DI? = 21. 


(v) Graph J? and 27 on the same plane. Explain the relationship between 
the two graphs in terms of rate of change and tangents. (w) Suppose that 


* Although this symbol is the same as that used for implication in this book, 
confusion is unlikely because of the different contexts in which the symbols are 
used. In particular, the implication arrow stands only between sentences. In 
order to avoid the double use of symbols, mathematicians often write longer 
arrows or double-shafted arrows for implication. 
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Df(x) > 0 for a certain zx. What conclusion could you draw? (x) Answer the 
same question for Df(x) < 0, and for Df(x) = 0. (y) Use the above methods 
to find (3122. (2) Justify 


(3) Dial?) = 2al. 


PROBLEMS 


In Problems 1 through 10 find the slope of the line tangent to the graph of 
the indicated function at the point with given x-coordinate, write the equation 
of the tangent line, and sketch. 


l.y = χδεαῦχ = —2. 2.47 = χα’ atv = 2. 
3. 12 atx = 0.5 4, 212 atx = —1. 
5. --312 atz = —1. 6. atx = 0. 
7. 13 ͵ = —1. 8. Pata = 1. 
9. 2135 atx = zo. 10. —I% aty = 8. 
11. Justify (4). 

(4) D(aI3) = 3al?. 


12. Justify (5) and (6) in terms of rate of change. 


(5) D(Z) 


1. 
(6) D(c) = 0. 


13. A freely falling body falls a distance y given by y = 16x?, where z is 
time measured in seconds from the time when the body was at rest and y is in 
feet. Find a formula giving the velocity of the body at any time. 

14. Recalling that acceleration is the rate of change of velocity with respect 
to time, find a formula for the acceleration of the body in Problem 13. 

15. Let x be the output of a factory and y the total cost of the product, and 
imagine Fig. 7-6 as the graph of the function y = f(x). What is the interpreta- 
tion of Df? Do you think that a cost function might have this appearance? 
Would Fig. 7-7 be more realistic? 

16. Let x be the time in years and y be the population in millions, and 
imagine Fig. 7~6 as the graph of the function y = f(x). Answer the questions 
of Problem 15. 

17. Let x be time spent in memorizing and y be the amount of material 
memorized, and imagine Fig. 7-6 to represent the graph of y = f(z). Answer 
the questions of Problem 15. 

18. V. O. Key (Primer of Statistics for Political Scientists, pp. 74-81) finds 
that the relation between the New York State presidential vote Y and the 
New York City registration X in millions was given approximately by Y = 
976,710 + 1.6026X during the years 1924-1948. Indicate what this means 
in terms of slope and rate of change. 
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19. In each of the following cases discuss the rate of change of y with respect 
to 7. 


y x 
(a) Distance Time 
(b) Wealth Time 
(c) Total cost Amount produced 
(d) Income from sales Amount sold 
(e) Amount of gasoline in a tank Time 
(f) Words memorized Time 
(g) Height of person | Time 
(h) Circumference of circle Diameter 
(i) Value of a(b + c) b+ ec 
* 20. Suppose y is the distance of a car from New York City and y = —35x + 


550. What is going on? Sketch the function. 


In Problems 21 through 23 we assume that the variables are known to be 
related by linear functions. 


*21. Suppose m = 2. List several possible functions with this property and 
sketch them on the same graph. 

*% 22. Suppose that m = 2 and y = 3 when xz = 1. Find the function and 
graph it. 

* 23. Suppose the interest rate is 10% and that the amount of money on 
deposit after 5 years is $150. Find the function and determine the amount 
on deposit after 6 years and also at the beginning. 


ANSWERS TO EXERCISES 


(a) mI +b. (ὁ y decreasing; not changing; increasing as 2x increases. 
(4) —b/m. (e) 20 mi/hr. (f) The slope of fis zero there. (g) Undefined. 
(h) 20 mi/hr; 40 mi/hr; undefined. (1) (0 2.2), (0 80). (m) (0_0.5) U 
(0.5.0.8) U (088 1.2). U (1.2__2.2), {50, 20,0,49}. (mn) 0. (0) The steeper 
the graph, the faster y is increasing. (p) DF turns downward, 1.6., decreases. 
(q) No unique rate of change there. ( 4. (0) m is undefined there; 
(Qha + h?)/h = 22 +h only for h #0. (u) y = 0; y—9 = 6(@ — 8); 
y—1 = —2(x+ 1). (v) Consider the intervals where 2x < 0, = 0, > 0. 
(w) f(x) is increasing at that point. (x) f(z) is decreasing; stationary. 
(y) 62. (2) [a(a + h)? — ax?]/h = 2ax + h — 2az. 


ANSWERS TO PROBLEMS 


l.y—4 = --4( -ἰ 2). 8. y — 0.25 = (Σ — 0.5). δ. y-+ 3 = βίῳ -Γ 1). 
7. DI3 = 312; γ-ΕἘ1Ὶ = 3(z+1). 9. y — 21:8 = θχξία — 1:0). 11. a(e +h)? — 
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ax? = 3ax*h + 3axh? + h3. 18. DIGI? = 327; v = 32x. 19. (a) Velocity, 
(b) Net rate of income, (c) Marginal cost, rate of change of cost per unit change 
in output, (d) Price, (e) Net flow of gasoline into tank, negative when driving, 
positive when filling tank, (f) Rate of memorization, (g) Rate of growth, 
(h) a, (i) a. 


7-3 Limits and continuity. From the previous section it appears that 
in order to define precisely the notions of “exact rate of change” and 
“slope of the tangent line” we need a definition of the notions of “approach” 
and “limit.” We wish to define “f(z) — Las xz — a” (f of x approaches L 
as x approaches a) and the synonymous expression “lim, ,, f(z) = L” 
(the limit of f(x) as x approaches a is L). Then we can derive rules for 
finding limits with precision and certainty. We introduce the definition 
by considering typical examples. 

To begin with a very simple case, consider 27 defined by f(x) = 2z. 
We have f(a) = 2a. Also it seems plausible that f(z) = 2. — 2a as 
x —a. Since |f(x) — 2a] = [22 — 2a] = 2|x — al, f(x) differs from 2a 
by an amount that becomes smaller as |z — a| becomes smaller. Indeed by 
taking x close enough to a we can make |f(x) — 2a| as small as we like. 
Suppose, for example, that we wish to make |f(x) — 2a] < ε, where e 
is any positive number. It is sufficient for this purpose to make |x — 2] < 
€/2. Indeed for any x such that |x — αἰ < €/2,|f(z) — 2a| < 2(€/2) = «. 
In particular f(z) - 0 85 2 — 0. 

Now consider 21 with the point (0,0) deleted (see Fig. 7-12). Here 
f(x) = 2x for x ¥ 0, and f(0) is not defined. Nevertheless it seems rea- 
sonable to say that f(z) — 0 as x -- 0, since |f(z) — 0] = [25 — 0] = 
21], and this is less than any positive ε, however small, provided 0 < [1] < 
€/2. Note that we had to exclude x = 0, since f(x) is undefined there. 

Now consider 27 with (0,0) deleted and (0, 2) added, that is, f = 
(21 ἡ {(0, 0)}’) ὦ {(0, 2)}. Here f(0) = 2, yet we feel that f(x) -- 0 
as x — 0, since [f(x)| is arbitrarily small provided [1] is small and not zero. 


Figure 7-12 
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᾿ II sin (1 7,0) 
« 


(1, 0) 


FIGURE 7-13 


We see from these examples that the limit of f(z) asx — a does not depend 
on f(a), which may be undefined or different from the limit. Hence our 
definition should not put any conditions on the function at x = a. 


(a) In this last example, what is |f(z) — 0| when = 0? 


To see the necessity of requiring that we can make |f(x) — L| sete 
small by taking |x — αἱ suffictently small, consider the function 1 24 197~1° 
As x — 0, x? + 1010 always gets closer to 0 and the difference Beenie: 
very small. But lim,o(z? + 107!°) 0, for 2? + 10. 5 cannot be 
made closer to 0 than 107 7°. 

Now consider the function sin [7/J] defined by f(x) = sin (7/z) and 
sketched in Fig. 7-13. For x € (( οὐ), w/x decreases from 7 toward zero, 
and sin (7/x) runs from 0 to 1 and back toward zero. However, as x goes 
from 1 toward 0, πα increases from 7 without bound. When x = 1/n for 
n integral, sin (7r/x) = sinna = 0. When x = 2/(4n + 1) with n in- 
tegral, sin (1/x) = 1. The function is not defined for x = 0. 


(Ὁ) For what values of z is sin (r/z) = —1? 


Evidently the graph oscillates an infinite number of times as 2 — 0. 
In any interval about the origin, no matter how small, the function oscil- 
lates between —1 and 1 an infinite number of times. We say that f(z) 
has no limit as x — 0 in this case. Note, however, that we can make f(z) 
arbitrarily small. In fact f(z) = 0 at infinitely many points. This em- 
phasizes again that for f(x) -- Las x — a, we must require that |f(x) — 1] 
remains arbitrarily small for all values of x sufficiently close to a. 

Finally, consider J sin (1/J), sketched in Fig. 7-14. We may imagine 
it constructed by distorting the graph of Fig. 7-13 so that it oscillates 
between the lines y = x and y = —x. Here f(0) is undefined, but the 
curve seems to be approaching the origin. Since |sin (7/x)| < 1, we have 
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. y 
—] | I 


I sin (/T) 


Πκ A : 


VAN 


Figure 7-14 


|x sin (7 /x)| < [1] < ε for all x such that [1] < €. Note that here f(x) 
is not always getting nearer its limit, but in spite of the oscillations, it 
stays within an arbitrarily small distance of its limit provided zx is close 
enough to its limiting value. 

It is convenient to visualize the situation in terms of a function f map- 
ping points on the axis of x into points on the axis of f(x), as in Fig. 7-15. 
We call the open interval (1, — e_L + e) defined by |f(z) — L| < ean 
epsilon neighborhood of L. Similarly, we call (a — 6_a-+ δ) a delta neigh- 
borhood of a. We call (a — 6_a-+ δ) /N {a}’defined by 0 < |x — αἱ < ὃ, 
a deleted delta neighborhood of a. If we are to have lim,.,, f(x) = L, we 
require that for x sufficiently near a, f(x) is arbitrarily near L. That is, 
given any e-neighborhood of Z (that is, given any e), there must be a 
deleted 6-neighborhood of a (determined by a δ) such that all points in 
the deleted neighborhood of a map into points in the neighborhood of L. 
Nothing is said here about the manner in which the points in the deleted 
6-neighborhood map into the e-neighborhood. If we imagine x as a point. 
moving toward a, the corresponding point f(z) may not always be moving 
toward L. But there is some deleted 6-neighborhood of a all of whose 
points map into the e-neighborhood of L. 

We can now give a precise definition. We require that for any given ε 
greater than 0, there must exzst a ὃ such that if x is in the deleted 6-neigh- 
borhood of a, then f(x) is in the e-neighborhood of ZL. Without the geomet- 
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if) —L)<e 
L—e L Loe 


F(x) 


O<jr—al< ὃ 


Figure 7-15 


ric imagery we have: given any positive ε there exists a ὃ such that 0 < 
lc — al < ὃ -- |f(z) — L| < ε. In symbols, 


(1) Def. [lim f(x) = 1] = [[(1ἡ - Lasx— al 
= [Vee > 0-365 >OAVLO < |x — al « ὃ > |f(~) — L| « εἰ. 


This definition is the outcome of the work of several generations of 
mathematicians. It may seem rather complicated, but all its parts are 
necessary, as 15 suggested by the preceding discussion. The idea is most 
easily grasped in terms of neighborhoods, and (1) is Just a reformulation 
in terms of the sentences defining the neighborhoods. 


(c) In the first example in which we argued that 2x — 2aas xz — a, the defini- 
tion is satisfied since we showed that ὃ = ¢/2 is such that 0 < |x — αἰ « ὃ -» 
[2 — 2a} < e«. Find another value for ὃ such that this sentence is satisfied. 
(ἃ) Rephrase this situation in terms of neighborhoods. (6) In the second 
example where we argued that 2x — 0 as x — 0, give two values for ὃ that sat- 
isfy the condition. (Ὁ Show that for ὃ = ε710,0 < [1] < ὃ — |xsin (#/z)| < 
e. (g) Prove that x? — 0 as 2 - 0 by finding a formula that gives the re- 
quired 6 in terms of «. (h) Show that 2azr -+ ἢ — 2az as h — 0 by exhibiting 
the 6 corresponding to any εἰ (4) Justify (2) and (8) by reference to the 
definition. 


(2) lim z= a. 
(3) | lim e(z) = ec. 


From the definition can be derived all the laws we need for calculating 
limits. In particular, 


(4) [lim f(@) = A A lim g@@) = Β] > [lim (f(x) + g(x)) = A + 8] 


(5) | - (lim (f(x)g(x)) = AB] 
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6) lim yx) = A x 0] = [lim (1/f(@)) = 1/4] 
(j) Why A +# Oin (6)? (Κ) State (4), (5), and (6) in words. 


The plausibility of (4) through (6) is easy to establish. For example, 
the hypothesis of (4) tells us that we can make f(x) arbitrarily close to A 
and g(x) artitrarily close to B by requiring x to be sufficiently close to a. 
Clearly, by making x close enough to a, we can then make f(x) + g(z) 
arbitrarily close to A ++ B. This suggests the way in which a proof could 
be constructed. Applying (1) to the hypothesis of (4) with (L:A, €:€/2) 
and (L:B, €:€/2), we see that there is a ὃ (call it δι) such that 0 < 
[5 — αἱ < 6, > |f(x) — A| < €/2 and a ὃ (call it 62) such that 0 < 
jc — al < 6. > |f(x) — B| < €/2. Let 63 = min {6), 62}. Then for 
0 < |x — αἱ < 63, both conclusions hold and hence |f(x) — A| + 
f(z) — Bl < €/2+ ¢€/2. But |f(z) + g(x) — (A+ B)| = |f@) —At+ 
σία) — Bi < |f(x) — Al + [σὰ — Bi. Hence 0 < |x — αἰ < 63 > 
f(z) + g(x) — (A + B)| < 6 which is. equivalent to the conclusion of 
(4) according to the definition (1). The other proofs we leave to optional 
problems. | 


(1) What laws and rules about existential quantifiers have we used here? 
(m) What law of inequalities did we use? (n) Derive (7), (8), and (9) from 
(2) through (6). 


(7) [lim f(z) = A] — [lim ef(x) = cA]. 
(8) [lim f(z) = A A lim σία) = B] = [lim [f(2) — 9@)] = A — Bh. 
(9) [lim Πα) = A A lim φᾷ) = Β γέ 0] — [lim [f(e)/g(@)] = A/B}. 


We have not required that f(a) = L whenever f(z) - L as Ζ -- a. 
Indeed, we have seen that f(a) may be different from L or undefined. 
However, when f(z) approaches a limit that is equal to f(a), we say that ἢ 
is continuous at a. More precisely, f is continuous at a if and only if 
lim,z.¢f(z) = f(a). Note that f may fail to be continuous at a point for 
any one or a combination of the following reasons: f(a) may not be de- 
fined, that is, a may not be in the domain of f; f(z) may not have a limit 
asx — a; or f(a) and lim f(z) may exist but be different. When a function 
is not continuous as a we say that it is discontinuous or that a is a point 
of discontinuity. The simplest kinds of discontinuities are at points where 
the graph of a function has a jump. 


(0) Cite examples of discontinuities from previous sections. (p) Is 1 sin (7/J) 
discontinuous at the origin? Answer the same question for the function 
7 sin (r/D) ὦ {(0, 0)}. 
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When a function is continuous at a point, its limit there can be 
evaluated very easily by the substitution (x:a) in f(z). Hence it is im- 
portant to know the continuity properties of functions. From (2) and (3) 
we see that the constant functions and the identity function are contin- 
uous at all points. By applying (4), (5), (7), (8), and (9) it is easy to show 
the following. 


If f and g are continuous at a, then cf, f + 9,f — g, and fg 
(10) are continuous ata. In addition, if g(a) # 0, f/g 1s continuous 
at a. 


(q) Show that all linear functions are continuous everywhere. (r) Show 
that I? is continuous at all points. 


When a function is continuous at all points of an interval it is said to be 
continuous on the interval. A function that is continuous on (—0o__ 0) is 
said to be continuous everywhere. 


(s) Show by induction that J" is continuous everywhere for n a natural 
number. (t) Show that all quadratic functions are continuous everywhere. 
(u) Use the theorems of this section to justify 2ar + ἢ — 2ax 85 ἢ — 0. 


PROBLEMS 


1. Sketch in the intervals (—2_—1/10) and (1/10 2) the function de- 
fined by f(z) = (—1)"!, where [1/z] is the largest integer less than or equal 
to 1/x. Does lim f(z) exist as x - 0? Is f(x) defined for x = 0? 

2. Do the same for the function defined by g(x) = xf(x) where f is defined 
in 1. What is lim, .ο g(x)? Is the function continuous at the origin? Could it 
be made so by adding a point to the graph? With the addition of this point, 
is there any interval containing the origin on which the function is continuous? 

3. Find lim,_,9 (mz + b)/(cx + d) assuming that d ¥ 0 and justify your 
conclusion. 

4, Find lim,_,9 (x?/z). 

5. State (11) in words and justify it. 


(11) [366 >0 AVzr0 < [xz — al « ὃ > f(x) = g(x)] — [lim f(z) = lim g(z)]. | 


6. Use (11) to find lim,_,; (17 — 2.5 -ἰ 1)/(a — 1). 


In Problems 7 through 14 find the limits and justify your procedure. 


2 
7. lim 1/2. 8. lim ad 3. τ΄ 2 
zl 2-2 x—-2 
ὁ μι 5 Ὁ 2 10. πὰ 2 
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3 3 
im π ττς 


11. lim (8+ “)" 12. 
z—1 h-0 
4 4 
#13. lim (1/h)[V2 - ἢ — Vo]. 14. lim ere. 
m—0 h—0 


15. Argue that all polynomials are continuous everywhere. Argue that all 
rational functions are continuous everywhere except at the zeros of their de- 
nominators. 

16. Show by induction that if f is continuous at a point, so is f” for n a natural 
number. 

*17. Show that if there is a neighborhood of a in which f(z) is bounded, and 
if lim;2g(z) = 0, then lim,_,. f(z)g(x) = 0. Express this in symbols. 

18. Show that [lim,,. f(z) = 1] <> Πὰς... (f(z) — L) = 0]. 

*%* 19. Give a geometric argument for lim, ,9 sinz = 0. 

20. Use the result in Problem 19 and (4-12-29) to show that sin is every- 
where continuous by showing that lim, ,, (sin z — sina) = 0. | 
% 21. Show that cos is continuous everywhere. 

% 22. Discuss the continuity of tan and cot. 

23. Is |Z| continuous at the origin? 

* 24. Show that if lim,,,f(z) = L, then there is a deleted neighborhood of a 
in which f(z) is bounded; that is, 16,420 < |x — al « ὃ > |f(x)| « M. 

* 25. Show that if lim f(z) exists and is positive, then there is a positive num- 
ber A and a deleted neighborhood of a in which f(x) > A. 

* 26. State and justify a result similar to Problem 25 when the limit is nonzero. 
% 27. Show that if limz,of(z) = L and lim,z,.f(x) = M, then M = L. This 
means that the limit, if it exists, is unique. 

% 28. Consider (sin θ),θ. Is it defined for 6 = 0? Does it have a limit as 
6 — 0? Use Fig. 7-16 to guess the limit and justify your results by a geometric 
argument. (Note that @ is in radians!) 


y 


FiGguRE 7-16 


* 29. Show that lim ae = 2. * 30. Find lim ᾿ 
- Ὁ 5-0 x 


-- COS & 


*31. Prove (5) by noting that |f(x)g(z) — AB| = [f(x)g(z) — Ag(x) + 
Ag(z) — AB| = |g(x)(f(z) — A) + A(g(z) — B)| < |g(a)| |f@) — Αἰ + [4] 
lg(z) — ΒΙ, and using the result of Problem 24. 
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32. Prove (6) by noting that |1/f(z) — 1/4 
using the result of Problem 26. 

*% 33. If f is continuous at a and g is continuous at f(a), then g(f) is continuous 
at a, that is, the composite of continuous functions is continuous. Prove this. 
* 34. Construct a function that is continuous at the origin but discontinuous 
everywhere else. 

*35. Show that if f(z) is bounded from above (below) in some neighborhood 
of a, then lim,_,, f(z) is not greater (less) than this upper (lower) bound. 

*36. Show that if lim σία) = lim A(z) as x — a and in some neighborhood of 
ag(z) < f(z) < A(z), then lim f(x) = lim g(x) as x — a. 


= |(1/Af(z))(f(@) — A)| and 


ANSWERS TO EXERCISES 


(a) 2. (Ὁ) 2/(4n+ 3) where n is integral. (ὁ) If 61 is a value of 6 for 
which the implication holds, then so is any smaller value of 6. (d) When # 
is in an (€/2)-neighborhood of a, 2z is in the e-neighborhood of 2a. (6) €/4, 
6/10. (ἢ |z| « {10 > lz] « ε. (zg) If « < Ve, then 2? « ε. (h) ε. 
(i) For (2), any ὃ < e. For (3), ¢(z) — ὁ = Ὁ, so any number will do for 6. 
(j) Otherwise the right member is undefined. (k) The sum and product of 
the limits is the limit of the sum and product. The limit of the reciprocal is the 
reciprocal of the limit, provided the limit is not zero. () (2-10-5) and the 
following discussion; (2-10-9). (ὦ) (3-9-11). (n) For example, for (7) 
apply (5)(g:c, B:c). (0) Step functions are discontinuous at the steps. 1/Z is 
discontinuous as ὦ = 0. (p) Yes, since it is undefined there. With the addi- 
tional point it is continuous. (q) Since they are sums of constant functions 
and constant multiples of the identity function. (1) Since J is continuous 
everywhere, so is 1.1 or 12 by (10). (s) For n = 1, we have the identity 
function. If Z* is continuous, so is ZJ-J* or I**+! by (10). (0) All quadratics 
are sums of constant functions and constant multiples of J and 12, all of which 
are continuous everywhere. (u) 2az-+ his linear in A. 


ANSWERS TO PROBLEMS 


1. No. No. 2. g(x) > 0 as x - 0, but g(0) is undefined. With (0,0) the 
function is continuous there. No. 3. b/d by use of (2), (7), (4), and (9). 5. If 
two functions coincide in a deleted neighborhood of a, then their limits (if any) 
as x — a must be the same. 7. 1. 9. 2/3. 11. 2°. 13. Multiply and divide 
by Vz-+h-+ Va and assume that 1112 is continuous. The continuity is not 
hard to prove by noting that /z — Va = (ἡ — a)/(Vx + Va). If|z — al < 
$:,2>a— 6, Vt > Va — &, and νῷ -ἰ Va > Va — δι - Va = M. 
Hence 1/(Wz - Va) < 1/M for |x — αἱ < δι. Now let 6 be the smaller of 
δ: and eM. Then for0 < |z — αἱ < ὃ, [νὰ — να] < |(eM)(1/M)| = € and 
hence Vz - Vaasx—a. 15. By Exercise (s) and theorem (10). 17. Apply 
(1). 23. Yes. 27. Assume M + LF and use the definition (1) to find a con- 
tradiction. 
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7-4 The derivative. We define the derwative Df of the function f by 


fle + ΩΣ = Siz). 


(1) Def. Df(z) = lim 


li 
h—0 
Note that Df is a function whose domain is the domain of f less those 
points (if any) where the limit does not exist. Thus D is a function that 
transforms a function f into its derivative Df. 

Now we can make the following precise definitions. The rate of change 
of f(x) with respect to x at a is Df(a). The slope of f at (a, f(a)) is Df(a). 
The tangent line to f at (a, f(a@)) 15 the line through (a, f(a)) having slope 
Df(a). Note that “slope” up to now has applied only to straight lines, but 
this definition means that we can speak of the slope of any curve at any 
point where the derivative exists. 


(a) Complete: The slope of a curve at a point is the slope of the 
(b) Justify (2). 


The equation of the line tangent to f at (xo, f(Xo)) 18 
(2) 
y — f(to) = [Df(xo)]( — Xo). 


(c) Apply (2) to find the equation of the tangent line to 812 at (—1, 3). 
(d) What is the slope of 3 at (0,3)? (6) What is the slope of 1275 at —5? 
(f) Review the discussion of rates of change and derivatives in Sections 7-1 
and 7-2. 


The number of situations in which the derivative is a useful tool is 
enormous. Let x be the output of a firm and y the total cost of producing zx. 
We have y = f(x), where f is the cost function. A possible graph is 
sketched in Fig. 7-17. In this case the slope of the curve at any point (the 


Y 


(x-+1, f(x) + Df(x)) 


ee (e-+1,f(e+1)) 


(+1, f(x)) 


Figure 7-17 
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derivative, the rate of change of cost with respect to output) is called the 
marginal cost. It represents the change in the cost per unit change in out- 
put if cost increases at a constant rate equal to the slope of the tangent 
line. Thus, in the figure at P, if output were increased 1 unit and the rate 
of increase remained constant (which it may not), then cost would in- 
crease by Df(x). Of course, we have distorted the scale in Fig. 7-17. On 
an actual cost curve a change of one unit in x would be very small, and 
f(x) and Df(x) would be almost the same as f(z + 1) and f(z + 1) — f(z). 


(z) What vector represents Df(x) in Fig. 7-17? (h) Describe the behavior 
of Df(x) in Fig. 7-17 as x increases. 

Copy Fig. 7-17 and sketch Df on the same graph. Do not worry about scale, 
but make Df(zx) increase or decrease as indicated by the figure. What do we call 
Df(x) where y = f(z) andy andzare: (i) distance, time, (7) velocity, time, 
(k) water ina tank, time, (1) altitude, horizontal distance? 


Definition (1) and the laws concerning limits can be used to find deriva- 
tives. However, the work is much simplified by using the following 
theorems. 


(3) De = 0. 

(4) DJ = 1. 

(5) D(f + g) = Df + Dg. 

(6) D(f-g) = f-Dg +g-0f. 
(7) Dic -f) = ες’ DF. 

(8) D(f — 9) = Df — Dg. 
(9) D(I/f) = —Df/f?. 

(10) D(f/a) = 3. Fe 
(11) DI*” = 15. 


(m) What is the geometric interpretation of (3)? (n) Complete the proof 
of (3) suggested by De(x) = lim (1/h)[(c)(x + Ah) — e(x)] = lim0/h = 0. 
(o) Similarly prove (4) by noting that Dr = lim [(ὦ + ὦ — x)/h]. 


To illustrate the usefulness of these theorems, let us find the derivative 
of 37? + 21 — 10 without and with their use. We have 


(12) D(3I? + 21 — 10)(z) 
= lim [3(a + h)? + 2(a@ +h) — 10 — 327 — 2x + 10]/h 
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(13) = lim (6hx -ἰ 3h? + 2h)/h 
(14) = lim (6x + 2 + 3h) 

(15) -- 6r+2 

(16) = (61 + 2)(2). 


The justification of (14) is that (6ha -+ 3h? + 2h)/h = 62 + 24+ 3h 
for all ἢ τέ 0, and the limit does not depend on the situation at h = 0. 
The same idea is embodied in (7-3-11). Step (15) is justified by the fact 
that 52 + 2 -+ 3h is linear in ἢ and hence continuous, so that we can 
merely substitute h = 0 in it. By using the theorems of this section the 
same result is obtained much more easily. 


(17) D(3I? + 21 — 10) = (3412 + D(2T) — DIO) ὁ) 

(18) = 3D(I*) + 201 — 0 (7), (3) 

(19) = 61 +2 (11)(n:2), (n:1). 
(p) What is the slope of 3J2-+ 21 — 10 at —3? Find the equation of the 


tangent line there. (q) Find (512 - 831) without and with the theorems 
of this section. 


To suggest the way in which laws (5) through (11) are proved, we give 
a proof of (5), leaving the others to the problems. 


(20) [Dif + 9)]() = iim [fl + h) + σία + h) — fle) — σα) ἢ (6) 


_ in [Fe +A) -- Κῶ σα - ἢ — ao) 
h-0 h h—0 h 
(7-2-4) 
(23) = Df(x) + Dg(z) (1). 
The variables “f,” “g,” “c” in the laws stand for functions. Indeed, their 


ranges are sets of functions having derivatives. The laws are identities 
relating functions. Of course, if we are interested in particular points we 
must insert an independent variable to get relations between images under 
functions and their derivatives. For example, (3) becomes De(z) = O(z). 
But since c(x) = cand O(x) = 0, we have ὃς = 0. Similarly (4) becomes 
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DI(x) = 1(x) or Dr = 1. Similarly (5) becomes D(f + g)(x) = Df(x) + 
Dg(x). The left member means the image of x under D(f + g). It is cus- 
tomary to write this D(f(x) + g(x)), so that (5) becomes D(f(x) + g(x)) = 
Df(x) + σία). Thus D(x? + 32) = D(I? + 3J)(2). 


(r) Rewrite (6) through (11) similarly in terms of the independent variable. 
(s) A body is moving so that its distance y is given by y = x” -+ 32, where z 
is the time. Find its velocity when zx = 3. (0) Find the slope of J* + 32+ 3 
atz = ἃ. 


There are many different notations for the derivative. Common syno- 
nyms for “Df(z)” are “f’(z),” “Dzf(x),” “df(x)/dz,” and “f,(x).” The last 
three are useful when an expression contains several variables. For ex- 
ample, if a person is asked to find the derivative of 2ax + 3y, he may 
not know whether the function is 2aZ + 3y, 2xI + 3y, or 2axr + 8]. 
These three possibilities may be indicated by writing D,(2ax -++ 3y), 
D,(2ax + 3y), D,(2ax + 3y). The first synonym, “f’(xz),” has the advan- 
tage of brevity. Usually derivatives are treated in terms of images so that 
one sees D(z? + x — 2) = 2x + lrather than D(J* + 1 — 2) = 27 - 1. 
Actually [D(7? + 1 — 2) = 21 + 1] © [D,(@? + a — 2) = 25 + 1]. 
The reader should use whichever notation is convenient. 

Still other notations are used in various contexts. Often Az is written 
for h and Ay for f(z + h) — f(x).. Then the average rate of change is 
Ay/Az, and Df(x) is the limit of this (if any) as Ax — 0. We call Az and 
Ay the increments of x and y respectively. Then Df(x) is expressed by 
ἀν ἄς, ν΄, Dy, or Dzy. 


Find: (u) D(z? — 522+ 3). (v) DUI? — 577+ 3. (w) D(I/D). (x) 
D(1/I?). 6) D(1/z?).  (z) D(w*(a — 855)). 


PROBLEMS 


In Problems 1 through 8 find the derivative by the definition and the laws 
of limits without using the laws of this section. Check by finding the derivative 
with the aid of the laws in this section. You may assume that (11) holds for all 
real values of n. 


1. DIA, 2. D(L/D). 

3. D(I'/2), 4. Dl — 1/I2). 

δ. D(I—}/2), 6. Dial? + bf + ὁ. 
7. Ὀ(δε — 2-3). 8. Die + 1)/(2 — D). 


9. Find D|J|. Suggestion: Consider αὶ > 0, x = 0,  < 0 separately and 
keep the geometric interpretation in mind. What is its domain? 
10. Describe the derivative of a step function. 
11. Do the same for a piecewise linear function. 
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%12. If the limit of (f(a + h) — f(a))/h as ἢ > O exists, that is, if Df(a) 
exists, we say that f is differentiable at a. Show that 


(24) (f ts differentiable at a) — (f ts continuous at a). 


13. Show that the converse of (24) is false. 

14. Show that a function cannot be differentiable at a point where it is 
discontinuous. 

15. Prove (6) by using the fact that f(ix+ θ λ)σία -ἰ- ἃ) — f(a)g(x) = 
fle + h)g(z + h) — flu + h)g(x) + φίλ + h) — f(x)g(x) = 
7 + Ajlg(e + h) — g(x)) + ga)[f(e + h) — F(z)]. 

16. Prove (7) from (3) and (6). 

17. Prove (8) from (5) and (7). 

18. Prove (9) by noting that 1/f(x-+h) — 1/f(z) = [f(z) --- [} - Ὁ} 
ethan h) = —[f(e-+ h) — f(a) /F(a) fe + A). 

. Prove (10) from (6) and (9). 

τ Prove D(if+g-+ σ) = Df+ Dg-+ Da. 

21. Prove (11) for positive integral n by induction. (We define 19 as 1.) 

22. Show that (11) holds for n = 0. 

23. Prove D(f-g-h) = f-g°-Dh+f-h:- Ὀσ -Ῥ σ΄ ἢ" Df. 
% 24. Prove (11) for negative integral n by using (10). 
% 25. Prove (25) and (26) by applying (1), (4-12-9), (4-12-10), and Problems 
28 and 30 in Section 7-3. 


(25) Dsin = cos. 
(26) Dcos = —sin. 


26. What is the slope of sin at the origin? at w/2? ata? at 3/2? at π᾿’ 43 


ANSWERS TO EXERCISES 


(a) tangent line at that point. (Ὁ) The point-slope form derived in Exer- 
cise (q) of Section 4-5. (c) (312) = 61, 62(—1) = —6, y—3 = 
—6(a - 1). (d) 0. (ὁ) (22/5)(—5) or —2. (g) The vector from (x + 1, f(x)) 
to (x + 1, f(z) + Df(x)). (h) Df(z) is initially positive and decreases for a 
time but remains positive. About halfway across the graph it begins and con- 
tinues to increase. (i) Velocity. (j) Acceleration. (k) Rate of flow. 
(1) Gradient. (m) That a horizontal line has zero slope. (n) c(z) = ὁ for 
all z. Insert “ἢ - 0” to complete the limit notation. (0) (2 τ΄ h— τ) = 
for all hh # 0. (p) D(372+ 21 — 10) = 61+ 2. (62+ 2)(—3) = —16. 
y— 11 = —16(2+ 83). (q) Ὀ(612- 8317) ) = lim (δώ + h)? + 38(@+ ἢ) — 
5a? — 32)/h = lim (10ha + 5h? + 3h)/A = lim (102 + 3+ 5h) = 105 -+ 8. 
D(512 + 32) = D(577) + D8) = 8012- 801] = 107 +3. 

(r) D(f(x)g(x)) = f(x)Dg(x) + g(x) Df(x). Def(x) = cDf(x). D(f(x) —g(x)) = 
Df(xz) — Dg(x). DU/f(x)) = —Df(z)/f?(z). D(F(x)/g(z)) = ἰσ() ὉΓ() — 
f(x) Dg(x)]/g2(x). Da” = nx*~!, (5) Df(z) = 24+ 3. Df(3) = 9. (Ὁ 9. 
(u) 8.2 — 10x. (Ὁ) 372—102. (w) —1/I?. (x) —2/I-3.) (y) —2273. 
(2) 22(1 — 9x?) + 2χ(α — 3x3) or 85:2 — 1524. 
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ANSWERS TO PROBLEMS 


1. 473. 3. (1/2)I-1/2. 5. (—1/2)I-3/2. 7. 5+ 3274. 9. The domain does 
not contain the origin. For 2 > 0, Diz] = 1 and for zx < 0, Dijz| = —1. 
Hence D|z| = |z|/z, DJZ| = [I|/Z. 11. Derivative is zero throughout its 
domain, which is the domain of the function less the endpoints of segments. 
13. |Z| is a counterexample. 15. This trick of adding and subtracting the same 
thing is very frequently useful. We have, with the aid of (7-3-4) and (7-3-5), 
Dif(x) - g(x)] = flim f(w + A)]- [lim [σία + h) — g(x)]/h] + g(a) lim [f(x + h) — 
f(x)|/h = f(x)Dg(x) + g(x)Df(x). Here we assume that as ὦ — 0, f(x + h) > 
f(x), that is, that f is continuous, which must be true anyway if Df 1s to exist, 
as indicated in (24). 16. D(ic-f) =¢-Df+f-De =¢-Df+0. 17. Dif—g) = 
Dif + (—1) +9] = Df+ (—1)Dg. | 

18. D{i/f(x)] = —lim [f(@ + h) — f(x)]/h- lim [1/f(z) " 7(α + h)] = —Df(z)/ 
[f(z)]?.. Note necessity of continuity of f again. 19. D[f/g] = D[f-(1/g)] = 
7: D(1/g) + (1/9) - Df = —fDg/g? + Df/g = (gDf — fDg)/g?. 21. Let p(n) = 
(11). Then p(1) = [Ὁ] = 1], p(n+ 1) = [DI"t! = (n+ 1)I"]. Assume p(n) 
by Hyp. Then, DI*+! = (1. 15) = 1: Ὁ15-ἃ 1*-DI τ 1.15π1-Ὲ I" -1= 
ni™+ I* = (n+ 1)1π,| 24. For n = —m with m > 0, DI-” = D(1/I”) = 
(1501 — 1. DI™)/12" = —mI*™—-1/P2" = —mI-""! = ni", 

25. Dsinx = lima_4o[sin (2 + h) — sin z]/h = lima_,o[sin x cos h + 
cos x sinh — sin 2]/h = lim;_,o[sin σίοοβ ἢ — 1)/h + coszsinh/h] = 
[sin z lim,_,o(cos h — 1)/h] + [cos z limpz_,o(sin h)/h]. Then see Problem 28 
in Section 7-3. 


7-5 Polynomials. Polynomials are functions that can be constructed 
from constant functions and the identity function by a finite number of 
operations of addition and multiplication. (See Section 5-12.) A poly- 
nomial has a defining formula of the form Shas cz", with c, ~ 0, and 
hence a name of the form )~'=(c,J'. We call πὶ the degree of the poly- 
nomial. Constant functions are polynomials of degree zero, linear func- 
tions are polynomials of degree one, and quadratic functions are poly- 
nomials of degree two. Third, fourth, and fifth degree polynomials are 
sometimes called cubics, quartics, and quintics. The c; are called coefficients. 

Since polynomials are continuous, their graphs have no breaks. Also, 


since 
(1) D> el ΞΣ 
i—0 i=l 


we see that the derivative of a polynomial is a polynomial. It follows that 
the derivative is continuous. This means that as the tangent line moves 
along the graph of the original polynomial it turns gradually without any 
sudden changes. In other words, the graph of a polynomial is smooth, 
without any sharp corners or other points at which there is no tangent. 
The fact that polynomials are both continuous and smooth justifies us 
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completely in graphing one by plotting selected points and joming them 
by a smooth curve. 


(a) Cite a continuous function whose derivative is not continuous and which 
has a sharp point at the place where its derivative does not exist. (b) Why 
are polynomials continuous everywhere? (c) Why does the summation in 
the right member of (1) run from 1 to n instead of from 0 to n? (d) Write (1) 
for the special case of a quadratic. (e) Justify (1). | 


An important subset of the polynomials is the power functions which 
have names of the form cl” with n a non-negative integer. We have 
already considered J and J”. Figure 7-18 shows I°. Since Ὁ15(0) = 
317(0) = 0, we know that the graph does level off at the origin as shown. 
This fact, together with our knowledge of continuity and smoothness, 
permits us to draw the curve with only a few points. We know that the 
curve always rises from left to right since Dr? = 32? > 0. 


Graph in (—2 2: (ὃ 1322, (g) —J%, = (h) 213. (i) ~ Why does 
Dz? > 0 imply that the curve is rising from left to right? 


Power functions of the form 125, that is, even power functions, are 
similar to 12, We show 12 and J* in Fig. 7-19. Since Dx?” = 2nx?"-! < 0 
for x < 0, = 0 for x = 0, and > O for x > 0, J*” is decreasing at the 
left of the origin, horizontal at the origin, and increasing to the right of 
the origin. Since x?” > 0, the graph never goes below the z-axis. 1525 is 
symmetric with respect to the y-axis, since x?” = (—zx)”". 

Functions of the form 77"*!, that is, odd power functions, have graphs 
similar to 13. Since Dx?"t+! = (2n + 1)x?" > 0, the curve is always in- 
aaron except where it flattens out at the origin. Since (—x)?"t! = 

z*"t1 the curve is symmetric with respect to the origin, ie., for each 
point a the curve there is another such that the segment joining the two 
passes through the origin and is bisected by it. 


Y 


11} pre 


[2 
13 


Figure 7-18 Figure 7-19 
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(j) Sketch 13 and 15 on the same axes. (k) For what positive values of 
cis a™tl < 2"? (Ὁ Review the discussion of symmetry in Sections 5-14 and 
5-15 and justify the above statements about symmetry. 


We have indicated that in order to graph a polynomial it is sufficient to 
plot key points and then join them by a smooth curve. Among the key 
points are those where the graph crosses the z-axis. If P is the polynomial, 
we seek the solutions of the equation P(z) = 0, that is, the zeros of P. 
We have simple methods for finding zeros of linear and quadratic 
functions. But for polynomials of higher degree, it is usually convenient 
and sometimes necessary to resort to approximation. All methods of 
approximation are based on finding a first approximation to a root, Le., 
an x for which f(z) = 0, and then improving the approximation by 
some systematic method. One of the most effective is called Newton’s 
method. 

To illustrate Newton’s method we consider f defined by f(x) = 2° — 
5a? - 6x — 3, which is similar to functions arising in genetics, economics, 
and many other fields. By trial we find that f(8) = —3 and f(4) = 5. 
Since f is continuous, it is evident that f must have a zero between 3 and 
4, as suggested in Fig. 7-20. It seems reasonable to take 3.5 as a first 
guess for the zero. We find (3.5) = —0.375, so this guess is not very far 
off. But how can we get a better approximation? In Fig. 7-21 we show 
the first approximation x; and the corresponding point (21, f(x1)) on the 
graph of f. In our problem (21, f(x1)) = (8.5, —0.375). We show also 
the tangent line to f at this point. Since the tangent is near the curve in 
a, small neighborhood of the point of tangency, we take as our second 
approximation the point (x2, 0) where this tangent meets the z-axis. 


ψ 


(σι, 7(41)} 


Figure 7-20 Fieure 7-21 
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It is easy to find this point, for we know by (7-4-2) that the equation 
of the tangent is y — f(x) = Df(x1)( — 21). To find ze, we substitute 
(y:0, 2:22) and get 


(2) Χ = 2, — f(x1)/Df(21). 


Since Df(x) = 3a? — 10x + 6 and Df(3.5) = 7.75, we find zr. = 3.5 — 
(—0.3875)/7.75 = 3.5 + 0.0484 = 3.55. The calculations were done on 
a slide rule, hence the approximate equalities. We find f(3.55) = 0.025 
and (3.54) = —0.054. Hence 3.55 seems a satisfactory approximation 
to two decimal places. | 

However, there is nothing to stop our continuing the process by using 
(2) with (71:72, 15:13) to find a third approximation. Indeed, we can find 
a sequence of approximations defined by the recursion formula 


(3) tn41 = ty — F(%n)/Df (an). 


If we wish to carry the calculation beyond one or two decimal places, it is 
convenient to have a desk calculator. With these or other computing aids, 
roots may be found to many decimal places very rapidly. 


(m) Let x1 = 3.6 and find x2 by (2) for the above problem. (n) Do the 
same for x3 = 4. (0) Do the same for x; = 8. (p) Use this method to 
find the larger one of the roots of x? + 2x — 4 = 0 to two decimal places and 
check your result by using the quadratic formula. 


* Theory of equations. The branch of mathematics that deals with finding 
the zeros of polynomial functions is called the theory of equations. The 
interested reader should consult treatises with this title. Two of the best 
known are by C. C. MacDuffee and by J. V. Uspensky. We give here in 
brief some of the most useful elementary theorems. 


(4) If P ts a polynomial and P(x) = (x — r)Q(x) + R, where 
Q is a polynomial and R is a number, then P(r) = R. 

This is the famous remainder theorem. It tells us that if we divide a poly- 

nomial by x — γ, then the constant remainder is the same as the image of 

r under the polynomial function. 


(q) Rewrite the equation in theorem (4) so as to indicate the way in which 
division is involved. 


The remainder theorem has an obvious application. If we wish to find 
P(r), we may do so by dividing by x — rand noting the remainder. Divi- 
sion can be systematized so that it is easier to carry through than sub- 
stitution and evaluation. 
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Another application is to derive the factor theorem. 


(5) A number r 18 a zero of the polynomial P af and only 1 Σ -ο 
is a factor of P(2x). 

A most useful application of this theorem is the following. Suppose 
that we have found one zero r; of P. Then we know that + — rr; 15 a 
factor, that is, P(x) = (x — r1)Q(x), where Q is a polynomial of degree 
one less than P. Now, in searching for the other roots of P, we need con- 
sider only the equation Q(z) = 0, called the reduced equation. As an ex- 
ample, let P be the polynomial used to illustrate Newton’s method and 
let γι be the zero we found approximately. We find by long division 


(6) «3 — 5x? — 6r — 3 = (a — 3.55)(x? — 1.455 + 0.85) + 0.02. 


If the root had been found exactly, the remainder term would have been 
zero instead of 0.02. Now, to find the other zeros we need consider only 
«2 — 1.452 + 0.85. However, the discriminant of this quadratic is less 
than zero and hence the function has no real zeros. Hence P has only 
the one real root we have approximated. 

‘Another useful conclusion that can be drawn from the factor theorem 15 


(7) κ A polynomial of degree n has at most n roots. 


This is easily seen as follows. By the fundamental theorem of algebra 
(see Problem 25 in Section 6-13), a polynomial P of degree n has at least 
one zero. Call it τῳ. Then P(x) = (x — r1)Q(x), where Q is of degree 
n — 1. But by the same theorem, Q has at least one zero. Calling it ra, 
we have P(x) = (x — σι) — re)R(x). Continuing in this way we find 
P(x) = (x — τι) — re)...(@ — tn) A, where A, γι, T2,..-, Tr are 
complex numbers. Since some of the r; may be the same, we know only 
that there are at least one, and at most ἢ, roots. 

Most equations that appear in applications do not have integral or 
rational roots. However, such simple equations often appear in textbooks 
and occasionally in real life. The following theorem is helpful in finding 
rational roots if they exist. | 


Tf a rational p/ q (where p and q are integers) is a zero of the 
(8) polynomial Dm —9 Ml*, ‘| where the m; are integers, then p must 
be a factor of mo and ῳ must. be a factor of mn. 


For example, we easily see that J? — 513 + 5I — 3 has no rational zeros 
because the only possibilities are 1, —1, 8, —3 and none of these satisfy 
the equation x? — 52% + 52 — 3 = 0. 
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(r) Use this to give a quick proof that ν 2 is irrational by showing that 12 — 
2 has no rational zeros. (s) List the possible roots of 30x* — 7x? + 1992x? -ἰ- 
21 = 0. (t) Find all roots of 2.3 — 5x2 — 9 = 0, and sketch the graph by 
plotting a few additional points. (u) Factor xt — 1 = 0 into linear factors. 


PROBLEMS 


In Problems 1 through 8, sketch the polynomial, showing its zeros. Find its 
derivative and sketch it on the same graph. Discuss the relation between the 
graphs. | 


ly = (x — 1)3. 2.y = (x — 1)2z. 

3. y = (x — La(x+ 1). 4. y = (27 + 22 — 1)(x? — 4). 
δ. y = (x2 + 22 — 1)(x?2 + 4). 6. y = a(x — 2)(x + 1)(@ — 8). 
7.y = —a(x — 1)3.Ψ 8. y = (a+ 2)2( — 3)2. 


9. Try finding the zero of the function in Problem 1 by Newton’s method, 

beginning with x; = 2. 

10. Do the same for Problem 7 starting with x1 = 2. 

11. Do the same for Problem 7 starting with x1 = 0.5. 

12. What happens in Newton’s method if we guess the root exactly at any 
stage? 

18. Show that the recurrence relation (2) becomes ρει = (tn + @/2,)/2 for 
solving x? — a = 0. Use this to calculate 1/2 to four decimal places. 
% 14. Work out a formula for finding principal cube roots, and generalize for 
nth roots. 


In Problems 15 through 20 find all zeros with the aid of the theorems of this 
section. Begin by testing for rational roots. When one root is found, use the 
reduced equation. Sketch the polynomials. 


*15. 135 — 617+ 61 — 1. 

*16. 2+ 572+ 61 — 8. 

*17, 14 — 413 — 7139 +4 84] — 24 = 0. 
W18. 7° — 5 = y. 

W19. χ,5 + 45 --  --άά4 = y. 

*20. 14 — 273 — 3724+ 474 4. 


%21. A function f for which f(z) = f(—z) is called an even function. What is 
the corresponding geometric property? Suggest an explanation of the term. 
Cite examples other than polynomials. 

% 22. A function for which f(z) = —f(—2) is called an odd function. Discuss 
as in Problem 21. 

% 23. Prove (4) on the assumption that for any polynomial P there exists a 
polynomial Q such that P(x) = (x — r)Q(x) + R where RF is a number. 

% 24. Prove (5) from (4). 

% 25. Prove (8). 

*%26. By making use of (4) written in the form P(zx)/(# — r) = Q(x) + 
P(r)/(z — r), show that DP(r) = Q(r). Illustrate. Restate with quantifiers 
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to make clear that the last equation is not an identity holding for a fixed Q 
and any r. 

% 27. Newton’s method is applicable to functions other than polynomials. Use 
it to find approximately the roots of s — 2sinz = 0. Begin by graphing 1 
and 2 sin to locate a first approximation. 

% 28. Show that if r is a zero of a polynomial with real coefficients, then so 
is its conjugate 7. 


ANSWERS TO EXERCISES 


(a) |Z]. (Ὁ) (7-8-10). (ὁ The derivative of the constant term in the left 
member is 0. The right member would be unchanged in value if the lower limit 
of summation were changed from1to0. (ἃ) D(AI#?-+ BI+C) = 2AI+ B. 
(e) From (7-4-11) and (7-4-7) we have D(AI") = nAI"~—1, which we apply 
to each term of the left member of (1). (f) Like Fig. 7-18, but each y half as 
large. (g) The mirror image in the z-axis of Fig. 7-18. (h) Each y twice 
that of Fig. 7-18. (i) Since Df is the slope of the tangent line, Df(z) > 0 
means that the tangent line is sloping up from left to right. See Fig. 4-27. 
(j) 15. should be below in (—o _—1) and (0 1) and above elsewhere. 
(k) (0 1). (m) 3.55. (n) 3.6. (o) 4. (q) P(x)/(a@ — τ) = Q(x) + 
R/(z — 1r). 

(r) The only possibilities are +2, +1. (8) +1, +38, 47, 41/2, 41/3, 
+1/5, +1/6, 41/10, +1/15, +3/2, +3/5, +3/10, +7/2, 47/3, +7/5, 
+7/6, +7/10, +7/15. (ὃ 8, (—-1 + ἐν 2ῶ8),4. (u) (@+)D@—-1) 
(2 + t)(x — ὃ). 

: ANSWERS TO PROBLEMS 


1 through 8. Check zeros by substitution. 1. Fig. 7-18 translated one unit 
to the right. 2. Tangent to z-axis at (1,0). 7. Tangent to z-axis at (1, 0). 
8. Tangent to z-axis at (—2,0) and (3,0). 13. Here f = 12 — a, Df = 2]. 
15. 1, (6 + V21)/2. 17. 1, 2, —3, 4. 19. Real roots are —4, 1; others are 
imaginary. 


7-6 Maxima and minima. Consider again the polynomial f = J? — 
517 + 61 — 3, one of whose zeros we found approximately by Newton’s 
method in Section 7-5. To graph f we may begin by finding a few points 
corresponding to simple values of x. In this way we get (—1, —15), 
(0, 5:59); (, =); (2, —~3), (3, —3), and (4, 5). 

(a) Make a large sketch showing these points and (8.55, 0) found in Sec- 
tion 7—5, use it to follow the discussion below, and complete it as you get addi- 
tional information. 


Apparently the graph rises from (—1, —15) to (0, —8) and to (1, —1), 
but how much farther? Does it cross the z-axis? If so it must turn down 
again and cross the axis a second time to get to (2, —3). Then, pre- 
sumably, it drops from (2, —3), but how far before turning up again? 
Does it continue to rise to the right of (3.55, 1)? To answer these questions 
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we use the derivative. Where the curve turns from going up to going down 
or from going down to going up it appears that it should have a horizontal 
tangent, that is, Df(z) = 0. [See (3) below.] But Df(z) = 3x7 — 10x + 
6= Ο δἱ χα -- (ὃ + V7)/3 = 2.55 or 0.78. Since f(2.55) = —3.6 and 
f(0.78) = - 0.89, we see that the highest point between (0, —3) and 
(2, —3) is (0.78, —0.89) and the lowest point between (2, —3) and 
(3, —3) is (2.55, —3.6). Since these are the only points where the curve 
is horizontal, we know that it cannot cross the z-axis except at the point 
found by Newton’s method above. This can be further checked by show- 
ing that Df(x) > Ο for x < 0.78 and for x > 2.55 and that Df(x) < 0 
for 0.78 « x# < 2.55. 


(b) Complete your graph of y = x? — 52*-+ 6x — 3. On the same axes 
sketch its derivative. (c) For what value of x is Df(z) minimum? How does 
this show up on the graph? 


We call a point where a function changes from increasing to decreasing 
or from decreasing to increasing an extremum. At an extremum of f, f(x) 
is either larger than at any nearby points, in which case it has what is 
called a relative maximum, or else f(x) is smaller than at any nearby 
points, in which case it has a relative minimum. We have seen that if a 
function is differentiable at an extremum, then it appears plausible that 
its derivative is zero there. This means that a good way to find extrema 
is to set the derivative equal to zero and solve. However, two cautions 
should be noted: 1. If the derivative fails to exist at a point, a function may 
have a high or low point there without having a zero derivative. An 
example is |Z|, which has a minimum at (0,0). 2. Even if the derivative 
exists and is zero, it does not necessarily follow that the poit is an ex- 
tremum. It may be simply a point at which f levels out momentarily, as in 
Fig. 7-18. In other words, at a point where the derivative exists, a zero 
derivative is a necessary but not a sufficient condition for an extremum. 


(d) Can the first case occur for polynomials? 


We call a point (x, f(x)) where Df(x) = 0 a critical point of f. At such 
points the tangent line is horizontal. To graph a function by finding its 
zeros and critical points, we begin by plotting easily found points, find the 
derivative and its zeros, plot the critical points, and determine the zeros 
of the original function, by approximation if necessary. These methods 
apply to other functions as well as to polynomials. 


(6) Use the above method to find the vertex of the quadratic function 
212 — 3I — 4 considered at the beginning of Section 5-8. (ἢ) Derive (5-8-4) 
and (5-8-5) by calculus. (g) What is the greatest possible number of extrema 
for a polynomial of degree n? (ἢ) Find the critical points of the function 
of Problem 15 in Section 7-5. 


7-6] MAXIMA AND MINIMA All 


The above methods are satisfactory for graphing simple polynomials. 
However, we need a more general and precise theory because problems of 
maxima and minima arise very frequently in mathematics and its applica- 
tions. For example, very often we are interested in doing something in 
the cheapest, quickest, or “best” way, and many laws of physics, chemistry, 
and biology are formulated in terms of least time, shortest distance, or 
minimum energy. 

We begin by distinguishing between absolute and relative maxima. The 
absolute maximum of a function f is the maximum of its range, that is, 
max [Rge (f)]. It is the largest value of f(x) for all values of x in the 
domain of f. Of course, some functions have absolute maxima and others 
do not. For example, the absolute maximum of the signum function is 1, 
since its range is just {—1,0,1}. But J, I”, and J—! have no maximum. 
The absolute minimum is defined similarly. 


(i) Define the absolute minimum and give examples. Review Section 3-6. 
(j) Find, or indicate the nonexistence of, min [1, ([Κ) of min (2-7 — 1), 
(1) of min {(z,y)|y = 22 +1A -- Φ 2< 1}. 


As suggested in Exercise (1), a function that has no maximum or mini- 
mum may acquire one if the domain is restricted. Indeed the following 
theorem can be proved from the properties of real numbers. 


If f is continuous on a closed interval (a__b), then f has an 
(1) absolute maximum and an absolute minimum in that interval; 
that ts, Jax € (a_b) A f(x) = max Rge [f ἢ (a_b) Χ Κα] 

and jez Ε (α δ) A f(x) = min Rge[f ἢ (α δ) X RI. 


(m) Show that it is necessary to require the interval in (1) to be closed by 
citing a function that is continuous on the open interval (0 1) but has no 
maximum there. (n) What are the max and min of 13 on the closed interval 
(—1 3)? (0) Why does 13 have no max on (—1_ 3)? 


We say that f has a relative maximum at a when there is neighborhood 
of a in which f(a) is the absolute maximum of f. A relative maximum is 
also called a local maximum because it is a maximum with respect to 
the other nearby values of f(x). Formally, 


(f has a relative maximum at a) 


(2) Def. 
= 3ε|]ὦ — αἰ < € > f(z) < f(a). 


(0) Formulate a similar definition of a relative minimum ata. (q) Define 
“extremum” in terms of “maximum” and “minimum.” 
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The above concepts are illustrated well by the function you sketched 
in Exercise (a). It has a local maximum at 0.78, a local minimum at 2.55, 
but no absolute maximum or minimum. However, if we restrict the do- 
main to (—1_ 4), the function has an absolute minimum of —15 and an 
absolute maximum of 5, both occurring at the endpoints of the interval. 


Indicate intervals in which the function of Exercise (a) has: (τ) an absolute 
maximum at 0.78 and an absolute minimum at 2.55, (8) an absolute maximum 
of —3, (0) an absolute minimum of —3. 


Finding extrema is very much facilitated by the following theorem. 


(3) If f zs differentiable at a and has an extremum at a, 
then Df(a) = 0. 


(u) Give a plausibility argument for (8). (v) Show that the converse is 
not true. (w) Show that a function may have a relative minimum without 
having a zero derivative at the point. (x) Show that an absolute maximum 
(minimum) is always a relative maximum (minimum). (y) Argue that (3) 
means that if a function is continuous on a closed interval, its extrema are at 
points where Df(a) = 0, at points where the function is not differentiable, or 
at the endpoints of the interval. 


Use of the second derivative. The above discussion tells us that Df(a) = 0 
is neither a necessary nor a sufficient condition for an extremum. When 
the function f is differentiable at a, Df(a) = Ο is a necessary condition, 
but it does not tell us whether f has a local minimum, local maximum, or 
neither. In many cases one can tell from inspection of the graph, but it 
would be good to have sufficient conditions. We now introduce the theory 
necessary for these. 

The operation Ὁ maps each differentiable function f into a function Df. 
If Df is also differentiable, we may apply Ὁ again to get D(Df), which we 
call the second derivative and write D*f. The reader should note immedi- 
ately that the exponent on D indicates composition of the operation D 
with itself. Thus D?f = (D[D])(f) and D?f ~ (Df). This is inconsistent 
with the previous use of exponents to indicate numerical squaring of func- 
tions. However, the usage is universal. The convention is that exponents 
on functions whose range consists of numbers indicate numerical squaring, 
while exponents on operations (functions) whose range consists of functions 
indicate composition. To find D?f, we simply differentiate f and then dif- 
ferentiate the result. For example, D?(J*) = D(DZ?) = D(2/) = 2 and 
p2(7-1) = D(— I~?) = 21.- 3. 

What is the geometrical interpretation of D*f(z)? Since Df(x) is the 
rate of change of f(x) or the slope of the curve f at (x, f(x)), D*f(x) is the 
rate of change of Df(x) or the rate of change of the slope. If y is distance 
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FIGURE 7-22 


and x time, Dy is the speed and D7y is the rate of change of speed, i.e., 
the acceleration. 


(z) Show that if a body falls so that its distance is given in terms of time 
by y = 16x”, then its acceleration is 32. 


Let us imagine the graph of f to be a roller coaster, as suggested in 
Fig. 7-22. A short segment of the tangent line becomes a car (with a 
short wheel base) moving along the curve. It carries a vector parallel to 
the tangent line, as indicated in the figure. For brevity we let Df(z) = m. 
At A, m > 0. Then m decreases as the car approaches B. At B, m = 0, 
and m continues to decrease to the right of B. But at C, m stops decreasing 
and starts increasing, that is, m has a minimum of C. This suggests that 
D?f(x) « 0 in the interval A to C and D?f(r) = Oat C. 

In an interval where D?f(x) < 0, the slope of the curve is decreasing 
and we say that the curve is concave downward. That is, the tangent line 
is rotating clockwise as it moves along the curve. It appears evident that 
at a point such as B, where Df(a) = 0 and D*f(a) < 0, f has a local 
maximum. Df(zx) > 0 immediately to the left, and Df(z) < 0 immedi- 
ately to the right. This means that f(x) increases as x approaches a from 
the left and decreases as it moves past. 


If Df(a) = 0 and D*f(a) « 0, then f has a relative maximum 
αἱ a. 


(4) 


(aa) State a similar theorem for D?f(a) > 0, and argue for it by considering 
the interval C to EF in the figure. (At a point where D2f(r) > 0, we say the 
curve is concave upward.) (bb) Show that the converse of (4) is false. (cc) II- 
lustrate the above results for the functions of Fig. 7-23 by finding the deriva- 
tives at the critical points. (dd) Find the local maximum and minimum of 
I? — 3]? — 61 + 8 without graphing. (ee) Prove (5-8-4) by (4). 


Theorems such as (4) are useful in finding extrema. However, it should 
be kept in mind that extrema may occur at the endpoints of an interval 
in which the function is defined, at points where the function does not 
have a derivative, and at points where the derivative exists and is zero. 
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Setting the derivative equal to zero detects only points of the third kind. 
The other two cases must be considered separately. 


(ff) At what points in Fig. 7-22 is there a relative maximum or minimum 
but no derivative? (gg) At what point does Df(x) = 0 without there being 
anextremum? (hh) Where is the absolute maximum of f? 


PROBLEMS 


In Problems 1 through 8, graph the function and its derivative. Find the 
critical points. 


(291? 37 ew. 2. 513 + 212 — 1. 
3. [4 — 72, . 4, 1-Δ, 

5. 1Χ(. --- 12). 6. 1Χ( --- D2. 

7. 1353Χ2 --ὐὐ. 8. ν = [2+ 7, 


9. In the graph of J—1, as x increases does y ever increase? Does the curve 

approach the horizontal as x gets very large? 

10. Show that for any real value of n such that n > 1, J” has a horizontal 
tangent at the point (0,0). What ifn < 1? 

11. What is the minimum of V2? — 212 + x? 
%12. Let R(x) and C(x) be the revenue and cost respectively of producing and 
selling x units of a commodity. We call DR(x) and DC(x) the marginal revenue 
and marginal cost. Show that the equality of marginal revenue and marginal 
cost is a necessary condition for maximum profit if we assume that R and C 
are differentiable. Write a sufficient condition. 
*%13. If u(z) is the utility of x, then economists call Du(x) the marginal utility. 
Why would you expect Du(x) > 0? What would Du(z) « Omean? Du(x) = 0? 
*% 14. Usually economists assume decreasing marginal utility. What does this 
mean in terms of derivatives? Why do they make the assumption? 
*15. Looking upon Ὁ as a function, what is Dom (D)? Rge (D)? The image of 
a function under D? 
*16. In statistics it is frequently of interest to have a measure of the differences 
between a set of numbers and some number that is representative of the whole 
set of numbers. Such representative numbers are often called measures of 
central tendency. A very commonly used measure of central tendency is the 
mean % defined by | 


_ 1 1=n 
(5) t= n = Li, 
where 21, Ζ2,. " , 2, are the numbers involved. Now let x be some number. 
We call x; — x the deviation of x; from x. We call 
(6) >, σι — a)” 


t=] 
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the sum of the squares of the deviations from x. Now for fixed 2,, (4) depends 
only upon x. Moreover, (6) defines a quadratic function of z, for 


=n 


(7) >») Ge x)” = x αἱ -- ὥχαι τ x”) 
i=1 i= 


(8) = na? —2 (Si ἡ). Ὁ Soak 

i=] ἐπεὶ 
The reason for the term nz? is that 22 is a constant relative to the summation, 
and we apply (6-7-19). Show that the minimum of (6) occurs when x = &. 
This result means that the sum of the squares of the deviations from the mean 
is smaller than from any other number. 
* 17. A point at which a curve changes its concavity, from concave downward 
to upward as at C in Fig. 7-22 or from upward to downward as at ἢ, is called 
a point of inflection. Inflection points of f are the extrema of Df. Argue that if 
(a, f(x)) is a point of inflection and D?f(a) exists, then D*f(a) = 0. 
*18. Is D?f(a) = 0 sufficient for an inflection? 
* 19. State sufficient conditions for an inflection. 
% 20. Find the points of inflection of the graph in Fig. 7-28. 


\ (0.78, 0) 


(3.55, 0) 
x 


(2.55, —3.6) 


FIGURE 7-23 
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% 21. Assuming (7-4-25) and (7-4-26), use (7-4-10) to show 
(9) | Dtan = sec”. 


* 22. What is the slope of the graph of sin at the origin? Where are its maxima 
and minima? its inflections? | 

*% 23. What is the slope of the graph of tan at the origin? 

* 24. Show that Ὁ sin (2J) = 2 cos (2/). 

*% 25. Show that a square is the rectangle that encloses the maximum area for a 
given perimeter. 

% 26. Suppose that a rancher wishes to enclose the maximum area by having 
a fence on three sides of a rectangle and a river on the fourth. If he has 1 mile 
of fence, what dimensions should he select? 

* 27. Find the shortest distance from the point (1,0) to the curve y? = Cz. 
(Suggestions: The distance will be minimum when its square is minimum. Do 
not forget endpoints, which are different depending on C. Draw Sketches!) 
% 28. The height above the ground at a time ¢ of a projectile fired at an angle @ 
with the horizontal and at an initial speed of vg is given by y = (vo sin #)t — 
(4)gt?. Find the time to reach the highest point and the height there. 

*29. The range of a projectile is given by R = (22 sin θ cos 6)/g. Show that 
the maximum range corresponds to 8 = 45°. 

*30. Economists usually assume that marginal utility decreases with in- 
creasing consumption. Express this in terms of derivatives. 

*31. A psychologist wrote: “The temporal course of adaptation is negatively 
accelerated, proceeding rapidly at first and gradually slowing in rate.” 
in terms of derivatives. 

*32. A sociologist wrote: “The proportion of Republican votes goes up with 
the socio-economic status of the voters.” State this in terms of derivatives. 
*%33. J. M. Keynes wrote in his famous General Theory of Employment, Interest, 
and Money: “The fundamental psychological law...is that men are disposed, 
as a rule and on the average, to increase their consumption as their income 
increases, but not by as much as the increase in their income.” Express this 
in terms of derivatives. 

* 34. Let x and y be the amounts of two commodities owned by an individual. 
An indifference curve is a set of points which all have the same utility to the 
individual. It is usually assumed that such curves are “negatively sloped.” 
What does this mean? It is also assumed that such curves are convex. What 
does this mean in terms of derivatives? Complete the following: Convexity of 
the indifference curves means that the more a consumer has of one commodity 
the _________ is the increment in it which is required to compensate for a decrease 
in the other. | 

*35. Show that f(z) has extrema at the same values of x as does f(z). 


ANSWERS TO EXERCISES 


(Ὁ) See Fig. 7-23. (ὦ Where the derivative of the derivative 6x — 10 is 
zero, that is, at (5/3, —7/3). It is the value of x where the original curve is 
dropping most rapidly, that is, has the smallest slope. (ἃ) No. (g) » — 1, 


Express 
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since derivative is of degree n — 1. (h) (3.41, —7.8), (1.59, —5.8). (i) The 
absolute minimum of a function is the minimum of itsrange. (1) 0. (k) None. 
(i) —5. (ἡ J-!. (ὦ) —!1 and 27. (0) There is no greatest number less 
than 27, but no number less than 27 is an upper bound of the values of x? for 
2 <3. (p) In (2) change “maximum” to “minimum” and < to >. (q) A 
relative maximum or minimum. 

(r) (( 8. (s) (τό 0)».᾽. (ὁ (0 22... (u) If the curve has a tangent, 
this tangent must be horizontal, otherwise the curve would be rising or falling 
and the point would be neither a maximum nor a minimum. (Note use of 
proof by contradiction.) (v) I? at origin. (w) [Z| at origin. (x) Immedi- 
ate from definitions. (y) (pA 4) r= |[~r>-~pV ~q). (2) Ὀ2(1612 = 
D(D16I7) = D(327) = 32. (aa) If Df(a) = 0 and D*f(a) > 0, then f has a 
relative minimum ata. (bb) —J*atorigin. (dd) D(z? — 327 — 64+ 8) = 
327 — 6 —~6 -ς 0.» 22 —22 —-2= 0027 =14V3. (ff) GF. 
(gg) H. (hh) At the right-hand endpoint. 


ANSWERS TO PROBLEMS 


4. No endpoints, maxima, or mimima. 5. Maximum at (0,1). 8. Minimum 
at (0,0). 9. Study behavior of D(J—!) as x increases. 11. Zero, since the prin- 
cipal square root can never be negative and the radical is zero for x = 1! No 
calculus needed here! 


7~7 Derivative of the composite. Suppose we wish to find the deriva- 
tive of (J + 2)}9. To use DJ” = nI”—' directly we would first have to 
expand (I + 2)}9, However, we see that (J + 2)} = I'J + 2] = 
(I + 2).I7'% That is, (1 + 2)}9 is the composite of J + 2 and 19, 


(a) Review Section 5-9 if necessary. 


Now, if we had a formula for finding the derivative of the composite of 
two functions, we could use it to find D(J + 2)'° directly. Such a formula 
exists. Indeed, we have the following law. 


(1) D(g[f]) = Dg[f] - Df. 


In words, the derivative of g-of-f is Dg-of-f times Df. In the above example, 
we have D(J!%Z + 2]) = (Ὁ10[1 + 2])-D(7 + 2) = 107°[7 4+ 2]-1 = 
10(4 + 2)°%. It is the purpose of this section to explain this law, which is 
called the chain rule. 

First we remind the reader of some facts about the composite. We 
recall that g[f] is defined by (g[f])(z) = g(f(z)), that is, the image under 
g{f] is the image under g of the image under f of x. To evaluate g(f(x)) we 
evaluate f(x), then g-of-f(x). The composite g[f] is not to be confused with 
the product g - f under which the image of x is g(x) - f(x). We use brackets 
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to indicate composition ard insert the dot for multiplication. But some 
writers use the ordinary parentheses or omit the dot; in such cases it is 
necessary to tell from the context which is meant. We have J?[J*] = 
(13)? — 13.135 = 7® but 17-7? = 1°. From (5-9-11) we have 


(2) j= ty) = 7. 


(b) Justify (2) by inserting the variable x. (0) Justify (3). (ἃ) Justify (4). 


(3) gif] = (ey) ly = g(f(a))}. 
(4) (f[Z]) (2) = f(a). 


The law (4) means that we may always get a name of a function by 
putting “J” in place of “x” in an appropriate defining formula. This is 
consistent with our notation for polynomials. For example, (15 — J)(x) = 
xz? — x. Also, 


(5) Af] = (gl ZDIFI. 


Note that we get a name for g[f] by putting a name for f in place of “J” in 
a name for g. For example, above we had 111 + 2] = (J + 2)!° and 
107°[7 + 2] = 10(7 + 2)9. 


(e) Justify (5). Review Section 5-12. (ἡ 1513 =? (ἡ) 13.158 =? 


To see the plausibility of (1), let us write it in terms of images. 


(6) D(g(f(x)) = Dg(f(x)) Df (a). 


Now, by definition, the left member 15 the limit as ἢ — 0 of [g(f(z -++ h)) — 
g(f(x))|/h. Let k = f(x + h) — f(x). Then the derivative we seek is the 
limit of [σ( (4) + k) — g(f(x))]/h. If k ¥ 0, this is the same as 
(7) g(f(z) + k) — gfe) . fe +h) — f(z) | 

k h 
Since the limit of the product is the product of the limits, the limit of (7) 


as h — 0 is the product of the limits of the two fractions. As h — 0, 
k — 0, and the limit of (7) is Dg(f(x)) - Df(z). 


(h) Why must we have k # 0? (i) Why is this not a proof of (6)? 
(1) Where did we assume that f is continuous? (k) If we use the notation 
df/dx for the derivative, (6) becomes 
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dg _ dg af 
dx df dx 


Would it be a proof of this to just “cancel” “df” in the right member? 
To illustrate the use of the chain rule, consider D(J? + 2)*. We have 


D(I? + 2)? = DUI*[I? + 2)) 
= D/7[I? + 2] - Ὀ(12 - 2) = 27[1* 4 2]- 21 
= 2(7?7 4+ 2).21 = 41(13 + 2) = 413 + 81. 


Without using the chain rule, we have D(J? + 2)? = D(I* + 41° + 4) = 
413 +- 81. The chain rule does not seem to be shorter here, but we have 
chosen a very simple example and our procedure for using the chain rule 
was very detailed. In practice we write (135 + 2)? = 2(1? + 2)- 21. 
We do this by thinking of the J? + 2 as though it were 7, and finding the 
derivative of (12 + 2)? “with respect to 7? + 2,” then multiplying by the 
derivative of J? + 2. For example, D(J? + 2) = 3(1? + 2)22]. 

(1) Verify this result by expanding (J? + 2)? and then differentiating it. 


(m) Justify (8). (ἡ Find ΡᾺ — J)* two ways. (0) Find Dsin [3/]. 
(p) Find Dsin 32. (q) Find D(I? — 1)'”. 


(8) D(f”) = ἡ "Df. 


* The binomial theorem. The following law is called the binomial theorem. 


ae tn nia” —*x’ . 
(9) Cag. = Σ, πη -- ἢϊ 


(r) Write out (9) for n = 1, 2, 8 and expand the right member in each case. 
Note that for? = O andi = n we get in the denominator 0!, which was defined 


as 1 in' (6-6-9). (5) Let (*) = nl/il(n — i)!. Show that (7) ἘΠῚ (") ΒΡ 
() τ ον τὼν (:) = n(n -- 1)(n — 3),8:. (t) Show that () = 
ι; i i): (u) Expand (# + 20)". 


To prove (9) we first note that (a + x)” is a polynomial of degree n; 
that is, 


(10) (ἃ ΞΕ ἃ} = x cx" 


4=0 


for some c;. This is fairly obvious, but it can be proved by induction. Now, 
to prove (9) we must prove that c; = nla”~*/i!(n — 7)! Since (10) is 
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an identity for properly chosen c;, we may apply (x:0), which yields a” = 
Co. But nta"—°/0!(n — 0)! = a”, so this result is as we wish. Now we 
differentiate both members of (10) to get 


(11) nia+x)" " = > te, 
i=l 


We omit the term with 7 = 0, since it vanishes. Now in (11) we apply 
(2:0) to get παῖ 1 = δι. Since nla”—!/1'(n — 1)! = na”—!, the desired 
result (9) holds for = 1. Now suppose we differentiate both members of 
(10) 7 times. We claim that the result is 


(12) (nl/(n — 20). (a+ 29 = YG — 20 e704 


This certainly holds for 7 = 1, since that is Just (11). Assuming that it 
holds for 2, it is easy to show that it holds for 7 + 1 by finding the deriva- 
tive of both members. Since (12) does not have meaning for 7 > n, this 
induction argument shows that (12) holds for7 < n. Setting (x:0) in (12) 
we find nla"~7/(n — 7)! = jle;/(j — 231, or c7 = nla®—7/jl(n — j)!, as 
we wished to prove. Hence (9) is proved by induction. 


(v) Write out (9), (10), and (11) for n = 3 without the summation notation. 
Determine co, c1, C2, c3, for the expansion of (a -+ 2})3 by differentiating your 
(10) successively and setting x equal to zero in the results. (w) Formulate 
the induction argument to prove that (a -+ x)” is a polynomial of the nth degree. 


PROBLEMS 


1. Show that the graph of (I — a)!® is tangent to the z-axis at (a, 0). 


In Problems 2 through 9 find each derivative by using the chain rule and 
verify by another method. 


2. D(2I — 1)?. 3. D(x? + 2)?. 4, D(x + a)”. 
5. Dia — x)?. *6. Ὁ cos?. *7. Dcos 2]. 
*8. Dsin?. 9. D(x? + 2axr + δ)2. 
In Problems 10 through 15 find the derivative by the aid of the chain rule. 
10. D(2z + a)}/?, 11. D(a? + 2?)1/2, * 12. Dsin (x? — 1). 
*13. Dsin®z. *14. Dsin?zx?. τ 15. Ὁ (sin? - 72). 


16. Prove that (a+ 2)? = a? + 2ar+ 2x? by setting + = 0, differentiating 
and setting z = 0 in the result, then differentiating again and setting + = 0 
in the result. 7 
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* 17. Give plausibility arguments for (13) and (14). 


(13) If f is continuous at a, and g is continuous at f(a), then 
g(f) ts continuous at a, and lim; .2g(f(x)) = g(f(a)). 


(14) If g ts continuous at limz,a f(x), then limz4a 9(f(x)) = 
g(limz,a f(x)). 


*18. Find lim,,2 (82? + 2r)° and justify your procedure. 

%*19. Complete the following: If g(f) is discontinuous at a, then —____.. 
*20. Find the slope of the tangent line to the graph of y = («2 — 3)? where 
x= 1. 

21. A projectile is moving in the xy-plane so that its z-coordinate is increasing 
at 10 ft/sec and its y-coordinate is increasing at 2 ft/sec. At what rate is its 
distance from the origin changing when x = y = 1? 

%& 22. A searchlight 10 miles from shore is rotating at the rate of one revolution 
per minute. At what rate is the light moving along the shore when the beam 
is perpendicular to the shore line? 

23. If a searchlight on the earth were rotating at the rate of 1 degree per min- 
ute, how fast would the light be moving on a perpendicular surface as far away 
as the sun, that is, at a distance of 93,000,000 miles? 

24. A cylindrical tank of radius 3 ft is being filled by water flowing in at the 
rate of 15 gallons per minute. How fast is the depth of the fluid increasing? 

25. Answer the same question for a tank in the shape of an inverted cone with 
height 10 ft and radius of base 2 ft. 


Expand the binomials by (9) in Problems 26 through 31. 


* 26. (a+ x)* *%27. (a+ 22)3 *% 28. (x — 2) 
29. (22 + 8υ)8 430. @+b+0)8 «31. (1 — 2)6 


* 32. Carry through the proof of (9) for n = 4. 

*33. What is the term involving «10 in the expansion of (1 + x)?8? 

% 34. Answer the same question for the term involving b® in the expansion of 
(ὃ — x)}°. 

#35. Use the binomial expansion to approximate (1.01)°°. 

* 36. If one dollar is placed at compound interest of 2% for 1000 years, to what 
will it amount? 

* 37. How long does it take money to double at 6%? 


38. Show that (,",)+(") = ("1"). 


ANSWERS TO EXERCISES 


(Ὁ) f(I(x)) = f(z); 1. 1)) = f(z). (e) Since (g[f])(x) = g(f(x)) by defini- 
tion. (dA) f[J =f. (ὁ 9 =g. () 7 (g) I (h) if & = 0, the 
first fraction is undefined. (i) Because k may be zero for some value οἱ h ¥ 0. 
(j) When we assumed k > 0 ash— 0. (k) No, in this alternative notation 
for the derivative we have not given any meaning to the symbols in the numera- 
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tor and denominator separately. (1) Ὀ(12-Ὁ 2)? = D(J®+ 6715- 1272+ 8) = 
615 + 2413 + 247 = 6114 + 4124.4) = 61(I2 + 222. (m) (1)(g:1”). 
(n) 4( — 18(—1). (0) 3 οοκ [37. (p) 8cos3x. (q) (1/2)(1? — 1)! 
(21) = [(12 — 1)-¥2, 


(r) Verify by multiplication. (5) For example, (7) = n!/0!(n — 0)! = 1 
(t) ni/il(n — ti)! = n!/(n — ἡ) ον — (n—7))! (ὦ) 10zty + ia ++ 


8: 

2,,3 4 5 om azz? 42 2 
80x~y? + 80xry* + 32y?. (v) (a+ x) ὉΠ] + —— “os zt a ατ΄ -ἴὉ' 
31 
or = a? + 8a2z -+ 8αχ2 + 23. (a+ 2)? = cot eye + cox? + e323. 
8(α + x)? = οἱ + 2coxr + 3e32?. co = αϑ; cy = 8a7?; co = 38a; cg = 1. 


(w) If (a+ 2)" is a polomial of degree n, (2+ αχ)ία -1- x)” = α(α -[- 2)"-+ 
x(a + x)” is of degree n - 1. 


ANSWERS TO PROBLEMS 


1. Letting f = (1 — a)!9-Df(a) = 10(a — a)? = 0. 8. 4x(x? + 2). 
5. —2(a — x). 7. —2sin 21. 9. 4(x -ἰ a)(x? + Qaxz + δ). 11. x(a? + 2?)—-1?, 
13. 3sin2x cosa. 15. 2sin- cos: 12-Ὁ sin? +27 = 21 sin (I cos-++ sin). 21. ἃ = 
(x? + γ2)"2, Did = (1/2)(2? + y?)—1/?(2eDe 4+ 2yDy) = (cD + yDy) 
(a? + y?)—1/2 = (10x + 2y) (x? + ν2) τ}, When x = y = 1, Did = 12/V/2. 
23. S = (9.3 X 107)0; Ds = (9.3 X 107)D,0 = (9.3 X 107) (=) απ δ 


25. The volume of water is given by V = x(2/5)22/3 = (π)775).5, where ὦ 
is the depth of liquid. D,V = (r/25)27Dz. DV = 15. Hence Dy = 
3757π:2. See Fig. 7-24. 


ion 


FIGuRE 7-24 
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7-8 Functions defined implicitly. By using the chain rule we can find 
the derivative of a function that is defined implicitly, provided we are able 
to find the derivatives of the functions involved in its definition. For ex- 
ample, suppose we wish to find the derivative of 112. This function is 
defined implicitly by y2= 2 Ay > 0. Letting f = I”, we have 
f? = I. Now we find the derivative of each member of this identity to 
obtain 2fDf = 1, from which we have Df = 1/2f = 1/21? = (1/2)J—""”. 
If we prefer to work in terms of images, we have y? = 2, Dzy? = Dz, 
2yD.y = 1, Dey = 1/2y = 1/2V¢ = (1/2)a—%". 


(a) Review Section 5-4 as necessary. (b) Obtain the same result by find- 
ing the derivative of each member of the identity (I'/*)? = I. (ὦ Prove 
that DI!/3 = (1/3)I-'3 by finding the derivative of both members of (J1/3)? = 
I. (d) Show that these results conform to (7-4-11). 


We can now prove that (7-4-11) holds for all positive rational powers. 
Let p and q be positive integers. Then 


(1) (Pita aay EP 
by (6-8-9). Differentiating both members, we find 
(2) g(1?!%)2—!py?/4 _ pl?” 


Note that we use (7-4-11) for πὶ a positive integer here. Dividing both 
members of (2) by the coefficient of DI?/%, we find 


(3) 0154 = (p/g)I?'*—, 


which is just (7-4-11)(n:p/q), as desired. 


(e) Carry through the algebra to verify that (3) follows from (2. (ἢ) (9) 
shows that (7-4-11) holds for all positive rationals. Show now that it holds 
for all rationals by letting r be a positive rational and completing 1" = 
DOLE): Shas 

Find the derivatives of: (g) 171-12, (h) 155, ὦ Ve—1, (j) V2? 4 2. 


Sometimes it is hard to find the derivative of a function directly, but 
fairly easy to find the derivative of its inverse. In such cases the following 
law is helpful. 


(4) | Df* = 1 ΡΠ], 


or, in terms of images, 


(5) Df*(x) = 1" ()}. 
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(k) Prove from (4) that Df = 1/Df*[f]. (Ὁ) Use (4) to show that DJ!” = 
(1/2)I-1/2, 


To prove (4) we write f(f*(2)) = « by (5-10-9). Finding the derivative 
of both members, we have 


(6) D(f[f*]) (x) = (DF[F*]) (a) - Df*¥(@) = 1. 


Hence Df*(x) = 1/(Df[f*])(~) = 1/Df(f*(x)), which is just (δ). 

To see the plausibility of (4) and (5), let y = distance and x = time, 
so that y = f(x) gives distance in terms of time and x = f*(y) gives 
time in terms of distance. Then Df(x) is the speed, i.e., the distance per 
unit time, and Df*(y) is the rate of change of time with distance, Le., 
the time per unit distance. Evidently one is the reciprocal of the other. 

Or, let k = fix + h) — f(x), so that f@+h) = f(z) +k =yt+k 
and f*(y+k)=—ath. Then Df(x) = limpz49 (K/h) and Df*(y) 
lim;+0 (h/k) = limao (h/k), since k — 0 and h — 0 together. But the 
limit of the reciprocal is the reciprocal of the limit, that is, lim (h/k) = 
1/lim (k/h). Hence Df*(y) = 1/Df(x) = 1/Df(f*(y)). This is just 
(5)(ury). 

The plausibility of (4) is still more evident if we adopt the frequently 
used A-notation. (See Section 7-4.) Then Df(x) = lim (Ay/Ar), and 
Df*(y) = lim (Ax/Ay). Again we see the reciprocal relation between 
the two. 

To illustrate the applications of the ideas of this section, consider a 
point moving around the circumference of the circle x? + y? = r?. Sup- 
pose that x and y depend on time in such a way that this equation is always 
satisfied. Then we have D,x” + Diy? = D,r*, which becomes 220,2 + 
2yDiy = 0 or Diyy/Dyx = —a2/y. In Fig. 7-25 we show the circle with the 


y 


FIGURE 7-25 
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vectors vz -- (D,v,0) and v, = (0, Dy), which are the time rates of 
change of x and of y respectively, or the velocities in the z- and y-direc- 
tions respectively. Their sum, v, is the velocity vector. The slope of the 
vector v is Dy/Daw = —x/y = —1/(y/x). Since y/z is the slope of the 
radius vector (zx, y), this shows that the vector v is perpendicular to the 
radius vector. Hence it is tangent to the circle at (x, y). This proves that 
the velocity of a point moving in a circle is in the direction of the tangent 
to its path. 

(m) Justify the assertion about perpendicularity. (n) Letting [νυ] = », 

lv.| = vz, [νυν] = v,, find v in terms of v, and v,. (0) Assuming that υ is the 
speed measured along the circumference, express it in terms of 6, 7, and J. 
(p) Letting D,6 = w, express vz, vy, and v in terms of w (w is called the angular 
velocity). 
"ἡ The chain rule is very useful when functions are defined implicitly and 
it is not convenient to solve explicitly for a defining formula. For example, 
the equation y? — 8x7y? + 3x°y + x + 100 = 0, if we limit x and y to 
real values, defines a function whose domain is the real numbers. To get 
an explicit formula would be awkward. (It could be done in this case, but 
no simple formula could be obtained for, say, a fifth degree equation in y.) 
However, we can easily find D,y. Finding the derivative of both members 
of the equation, and keeping in mind that y depends on x by some func- 
tional relation, we find 


(7) By?2y! — 16ry? — 162?yy’ + 15aty + 32°y’ + 1 = 0. 
(8) y’ = (16xy? — 15a4y — 1)/(8y? — 16x7y + 32°). 


Thus we can find a formula for the rate of change of y with respect to x 
without finding an explicit formula for y in terms of z! 

(q) Find y’ where 2y°+ xy*+ 525y-+ y/x —5 = 0. (ἡ Find y’ where 
x = Sin y. 

We handled these problems in terms of the variables and y. We 
could have dealt with them in terms of the functions directly by putting 
f in place of y and J in place of x, since the function f defined by a sentence 
s(x, y) must satisfy the sentence s(J,f) because of the conventions we 
are using for naming functions in terms of J. It is more usual to handle 
problems in terms of the variables when a convenient name for the func- 
tion is not available, i.e., when the function is defined only implicitly. 


PROBLEMS 


Find the derivatives in Problems 1 through 6 by applying (7-4-11) directly 
and by using the defining equation. 


1. I-;a2y = 1. 2. 17 2; χὴν = 1. 3. 11/5. 9° = ἃ, 
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Ah dea ye = LN GS 
δ. (1 -- DMsaety=alaAy> 0. 
θ. -([ -- 1212, ΣΈ y= 1A 


7. Show that as x gets very large in absolute value the slope of the parabola 
02. — a?y? = a*b? approaches b/a or —b/a, that is, the slope of the asymptote. 
8. Show that there are only two points where the slope of the graph of 
62x? + a?y? = a?b? is zero. Where does it have a vertical tangent? 
9. Find y’ where ay? + by + ¢ = x. Check by solving for y and finding the 
derivative of the result. 
*10. Show that there are at most two points where the graph of any conic has 
a given slope, unless the conic is degenerate. 
*11. Find Ὁ Aresin from the defining equation x = Sin y. 
%*%12. Find it by making use of (4). 
*%13. Treat Arccos similarly. 

14. Assuming that a function is defined by x7y? + 2ry2 - 18 = 0, find D,y. 
*15. Show that the velocity of a point moving along the curve given by y = 
f(x) 1s tangent to the curve. 

16. Find the derivative of 115 by using (4). 


In Problems 17 through 22 find the derivative of the function, assuming that 
f is an unknown function. 


17. 12f + 203. 18, fi/2 — 1348, 
19. 1,2 sin. 20. 34 — 31-|,ϑ 
21. 217 — If + f?. 22. sin [72]. 


23. Prove that the curves 21. -ἰ 3y+ 2° — zy? = 0 and 2y+ zx%y* — 
3x = Ὁ are perpendicular at the origin. 

24. Find a formula giving the equation of the tangent line to the ellipse 
6727 + αν = ab? at the point (zo, yo). 

25. Solve a similar problem for the hyperbola. 

26. We call cI—! an inverse variation. When y = cx—! we say that y varies 
inversely with x. We say also that y is inversely proportional to x with c the 
constant of proportionality. Since c(—x)—! = —cx!, an inverse variation is an 
odd function and its graph is symmetric with respect to the origin. The special 
case I—! is sketched in Fig. 7-26. Since DJ~! = —I~?, we see that the slope 
of I—! is always negative, that is, f(x) is always decreasing. 

Suppose that a company has an overhead of c. Then if y = the overhead per 
unit output, and « = output, y = cx—!. The function is graphed in Fig. 7-27. 
The horizontal line is the constant function giving overhead. We see that as 
production increases, y decreases toward zero. On the other hand, as x goes to- 
ward zero, overhead per unit increases without bound. Find the marginal unit 
overhead, D,(cz—1). Sketch it. Is it increasing or decreasing? 

27. Show that if (αι, y1) and (ze, y2) belong to cI~!, then ye/y1 = x1/x2 
and σι = 1272. 

28. Why is D(1/z) < 0? 

29. Is I~! everywhere continuous? 

30. Find the slope of 13 at (1, 1). 
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Figure 7-26 FIGURE 7-27 


*31. If the cost is given by y = mz-} - ὃ, then unit cost ῳ is given by αὶ = 
y/x = m+ b/x. Graph the unit cost function for m = 20, ὃ = 100. 

* 32. Let C be the total cost and C = x? -+ 100z-+ 100, where x is output. 
Graph y = C/z, the unit cost function. Also graph its derivative, the marginal 
unit cost function. 

*33. Let f be the function of Fig. 7-6. Graph f/J and its derivative. What is 
f/I if f is the cost function of a firm? What is f/I if f is the function giving the 
distance traveled by a car as in Section 7-2? 

* 34. Let f be the function graphed in Fig. 7-7. Sketch f/J and its derivative. 
Interpret your results in terms of cost and unit cost. 


ANSWERS TO EXERCISES 


(b) 201/2py1/2 = 1, therefore Ὁ1Π2 = 1/271/2 = (1/2)I-!/7.— (ce) First 
step is 3(I1/3)?D71/8 = 1. (ἃ) By appropriate substitutions in (7-4-11). 
(e) (1! αγ4-- = χρτρία. Then [?~-1]P/a—-» = χρίασὶ (ἢ =(I"D1 — 1DI*)/ 
[2 = —p]r-1/q2r = —I-7-1, (ἡ —(1/2)I-3/2. (ὦ) (8/8) 1-2. 

(i) (72) --- ἡ) τῶ, (j) x(a? + 2τ5, ἡ (f:f*). Then use law 
fee =f. (Ο DIP? = 1/DI°AT12] = 1.21{11|2] = 1/21"? = (1/2)I-"2, Here 
fe = [2 ff = 13. (m) (4-7-8). (n) v? = v2 + 22, (0) υ = γθ,0. 
(p) » = τω; vz = —vsin#d, v, = v cos §, (q) y = νά — x?y3 — 2526) / 
x(10xry* + 422y3 + 528+ 1). (r) 1 = (Cosy)y’;y/ = 1/Cosy = 1/4V 1 — 2”. 


ANSWERS TO PROBLEMS 


7. y = b2x/a®y = +b72/a?(b/a) (x? — α2)}"2 = 4(b/a)(2/(x? — a*)4/*). As 
x gets very large, 22 — a2 = 27. 17. 21} - If’ + 4ff’. 19. —f-* 
sin—2(2ff’ sin + 72 cos). 21. 41 —f — If’ + 3f7f’. 23. From the first 
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equation, 2+ 3y’ + δ: — y? — 3zy2y’ = 0 and y’ = (ὃ — d5a* — 2)/ 
(8 — 8zy”). From the second equation, 2y’ + 322y4 + 4r3y3y’ — 3 = 0 
and y/ = (3 — 827y4)/(2 + 42%y%). At the origin, the first has a slope of 
—2/3 and the second of 3/2. 


7-9 The antiderivative. According to Newton’s laws of motion, a force 
F acting on a body of mass m gives it an acceleration a according to the 
relation F = ma. Now a = Dy, where »v is the velocity measured along 
an axis in the line of action F, and ¢ is the time. Hence F = mDy. Let 
us use this to find a description of the action of a body falling freely from 
rest. For such a body the force of gravity is proportional to the mass. 
Letting the constant of proportionality be g, we have F = mg. Sub- 
stituting in the previously derived equation, we have mg = mD,w, or 


(1) . Div = 0. 


It is because of (1) that g is called the acceleration of gravity. 

To describe the motion we should like to find the relation between υ 
and ἐ, that is, some function f such that v = f(t). From (1) Df =g. A 
function with this property is σ1, so that f(é) = gt is a posszbility. Of 
course f(t) = gi + C for any C would do. However, we wish the body to 
be falling from rest, so that f(0) = 0. Hence we choose f(t) = gi and 
write 3 


(2) " = gt. 


But v = D,s, where s is the distance measured along some axis in the 
line of action of F. Hence we now have 


(3) Dis = gt. 


Here we wish to find 8 in terms of ¢. What function F has the property 
that F’(t) = gt or DF = gI? One possibility is (1,,2)σ 13. Also, for any 
C, (1/2)gI? + C has the same derivative. However, if we measure s 
from the point where ¢ = 0, we have | 


(4) s = (1/2)g¢?. 


This is the famous law of motion of a freely falling body, which the 
student may have seen in physics books, and which may have seemed 
mysterious. It follows quite inevitably from Newton’s axioms. Of course 
it is confirmed by experiments, which is one of the facts that makes 
Newton’s axioms acceptable. If the pound-foot-second system is used, 
measurements give g = 32 ft/sec/sec; if the gram-centimeter-second sys- 
tem is used, g = 980 cm/sec/sec. 
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(a) Suppose that the acceleration of ἃ body varies directly with time with a 
constant of proportionality k. Find the velocity and distance at time ἐ, as- 
suming that it starts from rest at i = 0. 


In (1) we have an equation which we wish to solve for an unknown 
function f. Indeed (1) is equivalent to the equation Df = g. To solve it 
we have to answer the question: What function has a derivative equal to 
a constant function g? Equations which involve derivatives of unknown 
functions are called differential equations. Equation (1) is an example. A 
slightly more complicated one 1s 


(5) ΟἿ) - 2Df(x) + 3f(x) = x”. 
Differential equations of the form 
(6) DF (x) = f(x) 


occur frequently in mathematics. Eq. (1) is a special case with (F:f, f:g). 
In previous sections we have been solving problems in which F is given 
and f is to be found. Here we are interested in the inverse problem, given 
the derivative f to find the function F. We call any solution of (6) for 
F an antiderivative of f. 


Find an antiderivative of: (b) 21, (c) mI+ ὃ, (d) 8312, Find a solu- 
tion of each of the following: (e) Df(xz) = 2? — 32, (ὃ Df(z) = —1/z?, 
(g) Df(z) = 1. 


Suppose that DF = f, so that F is an antiderivative of f. Then 
DF +C) = DF+ DC = DF+ 0 =f. 


Hence F + C is also an antiderivative. Hence, if a function has an anti- 
derivative, it has infinitely many formed by adding arbitrary constant 
functions to any one of them. But are there still other antiderivatives not 
obtainable by adding constants? The answer is no, according to the fol- 
lowing theorem. 


(7) (DF = DG) > (Ac F = G -Ὁ ὁ. 


In words, if two functions have the same derivative, i.e., are anti- 
derivatives of the same function, then they differ by a constant function. 
To prove this we need to develop some further theory, which is of great 
importance for other purposes as well. 

Consider the function pictured in Fig. 7-28. The slope of the secant 
line is [f(b) — f(a)]/(b — a). It appears obvious that there is some point 
in (α 6) at which the tangent line is parallel to this secant; i.e., there is 
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(a, f(a)) 


Figure 7-28 


some & (pronounced kseye) for which a < ἃ < ὃ and Df(£) = [f(b) — 
f(a)]/(6 — a). More generally, we state the law of the mean. 


(8) If f 1s continuous in (α ὃ) and ts differentiable in (ᾳα Ὁ), 


then 3ξ (a < ξ « b) A [f(b) = fla) + (ὃ — a)Df(é)]. 


This can be proved, but we give only some exercises to indicate its 
plausibility. 


(h) Prove (8) for quadratic functions by showing that the required point is 
actually the midpoint of the interval in this case. (Suggestion: Let the quadratic 
function be given by f(z) = Az? -+ Bxr+C rather than by az? - bz+ ¢ to 
avoid confusion with the letters giving the interval.) (1) What is & for a linear 
function? (j) Give an example to show that continuity in the closed interval 
is necessary. (k) Give an example to show that differentiability in the closed 
interval (α 6) is not necessary. (1) Give an example to show that differenti- 
ability in the open interval (a__6) is necessary. (m) Show that the velocity 
of a freely falling body at the middle of any time interval is equal to its average 
velocity during the whole time interval. 


We can now prove the following. 
(9) (Df = 0) > (Ace f = ©). 


In words, if the derivative of a function is everywhere zero, then the 
function is a constant function. Equivalently, 


[V2 Df(z) = 0] — [3c Vx f(x) = cl. 


This is easily proved from (8). For suppose, by hypothesis, that Df = 0, 
and let f(a) = ας where ais any number. Then by (8), 


f(b) = e+ (6 — a)Df(é) 
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for some ἕξ in (a__b) or in (b__a). But Df(z) is zero for any x. Hence 
f(b) = c. Since this argument holds for any b, f = ec. 


(n) In the above proof we have said nothing about the continuity or dif- 
ferentiability of f, as required by (8). Fill in these omissions. 


Now we can prove (7), for consider the function F — G. We have 
D(F — G) = DF — DG = 0 by hypothesis. Hence by (8), F — G= 
corF=G-+e! 


(o) Find a formula that gives all the solutions of Df = 81 - 1. 


We looked at D as an operation (function) in Section 7-6. What, then, 
is D*, the inverse of D? We have Ὁ = {(f,g)|g = Df}, where we use ἢ 
as the independent variable and g as the dependent variable. D maps 
each function f into its derivative. Then D* = {(,f)|g = Df}. Here 
D* maps each function g into its antiderivatives. Note the plural, since 
corresponding to each g there is an infinity of /’s. 

We see that D* is not a function according to our definition of this term. 
However, it is customary to view it as an operation that maps a function 
into the set of all its antiderivatives. More common symbols for D*g are 
D—1g and fg, and for D*g(x), D~*'g(x) and fg(x) dx. The “dx” serves 
merely to indicate the independent variable, like the “x” in “D,f(z).” 
The origin of the symbol f will be indicated in Section 7-12. D~‘g is used 
as an abbreviation for a formula that stands for any one of the antide- 
rivatives of g. Since D(D~1g) = g and D?(D~'g) = Dg, the negative 
exponent on D appears to behave like an exponent on numbers. However, 
it represents the inverse with respect to composition, not numerical 
reciprocation. Thus D~'g τέ 1/Dg. Sometimes the antiderivative is called 
the indefinite integral. 

The relation between D~—! and Ὁ is expressed formally as follows 


(10) Def. [D~"g = f] = {Df = 9]. 


When we speak of finding the antiderivative of a function g, we mean 
finding the solutions of Df = g, that is, finding a formula that gives all 
the antiderivatives of g. For example, Ὁ 11 = 12,2 + C, where C is an 
arbitrary constant function. Checking by (10), we find D(I?/2 + C) = 
27/2+0= 1. 

In terms of images, 


(11) [D~*g(x) = f(z)] = (Df) = 9@)]. 


Hence D~!(r) = x7/2+C, where C is an arbitrary constant, since 
D(x?/2 + C) = 22/2+0= x. It is essential to include the arbitrary 
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constant, because Df = g has many solutions. Thus, although D~*(r) = 
«2,2 according to (11), the right member gives only one of the possible 
values. 


Find: (p) D-*(2I), (q) ς᾽, (r) D'(mI + δ), (5) DIA2+ a+ 
327 + 2°), (Ὁ Dod — 2)3. 


If we wish to find Df(z), we may apply (7-4-1), or rules developed from 
it. But to find D~'g(x) no such direct procedure is available. Just as we 
divide by using our knowledge of multiplication, so we look for anti- 
derivatives by using our knowledge of derivatives. Since this kind of 
indirect searching can be tedious, mathematicians have prepared tables of 
antiderivatives, usually called tables of integrals. Among the most widely 
used are Table of Integrals and Other Mathematical Data, by H. B. Dwight; 
C.R.C. Standard Mathematical Tables, edited by C. D. Hodgman; A Short 
Table of Integrals, by B. O. Peirce, The Handbook of Mathematical Tables 
and Formulas, by R. S. Burington, and Handbook of Calculus, Difference 
and Differential Equations, by E. J. Cogan and R. Z. Norman. 

Suppose, for example, that we wish to solve the differential equation 
Df(x) = σία: + δ), 3. Offhand (or even not offhand!) it is not easy to 
think of an expression whose derivative is x/(ax + δ). However, we 
find on page 60 of Burington, 


x dx b 1 


(12) (ax + δ), ~~ 2a2(ax + b)? a?(ax + b) 


This means that the expression on the right is an antiderivative of 
z(ax + ὃ) 3. By (11), this is to say that the derivative of the right 
member equals x(az + b)~*. Let us check this. 


D[(b/2a”) (ax + δ). — a "(αὐ + δ). 1} 
= $2—!a—?(—2) (ax + δ), 3α — a~?(—1) (ax + b)~7a 


—ba!(ax + 0) 5 + a7! (ax + b)—3(ax + B) 
= (ax + b)~*(—ba~* + & + a~'b) = x(ax + δ) “ἃ 


as desired. Hence all the antiderivatives are given by the right member of 
(12) plus an arbitrary constant. 

Integral tables contain thousands of formulas such as (12). Because of 
(11) they may be read also as formulas for derivatives. There is also an 
extensive theory of how to find antiderivatives with or without such tables, 


Ω 
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which the reader may locate in calculus books under the heading “methods 
of integrations.” (See also Section 7-18.) 


(u) Prove that 


—— = -Varr*+e. 
Vax? + ¢ a 
PROBLEMS 


1. Show that constant functions are the only ones having zero derivative at 
every point. 

2. Show that straight lines are the only curves having the same slope at 
every point. 


In Problems 3 through 10 a function is given. Find the family of antideriva- 
tives in each case. 


3. I — 81’. 4,VI+1. 
ὅ; P+ 1, *6. sin. 

7. 18. - 3421. 8. 3 — 21-8, 
9, 0. 10. Τρ Ὁ 


11. Show that (72 + 1)? is not δὴ antiderivative of (12 + 1)!/”. 
12. Show that ~4dn DI” = 17. 


In Problems 13 through 16 f(z) is given. In each case find D~!f(z). 
18. αν. 14. x --- 8:3. 18: 1: 16. cos a. 


17. The slope of a certain curve at any one of its points (x, y) isz — 3. Find 
the family of possible curves. Find the curve if we know that it passes through 
the origin. 

18. The slope of a curve is x* — 1 and it passes through (2, —5). Find the 
curve. 

* 19. A body is moving in a straight line with an acceleration given by δέ — 2, 
where ¢ is the time. Find its velocity if it is motionless at ¢ = 0. Find its dis- 
tance from the position at ¢ = 0. 

* 20. Work Problem 19 with the initial velocity 20. 

%& 21. A boy throws a ball downward with an initial velocity of 20 ft/sec from 
the top of a building 100 ft high. When will it strike the ground? How much 
later would it strike the ground if he just dropped it? 

% 22. Show that a body falling freely with a velocity of vp and a position given 
by 80 measured from some origin in its line of motion at time ἐο will move ac- 
cording to the equation s = so + v(f — to) + (4)g(t — to)’. 

% 23. Show that »? = v2-+ 29(s — so) in the situation described in Problem 22. 
*24. Show that D?s = vD,v in the same context. (Hint: Chain rule.) 

* 25. Practice differentiation by verifying formulas in a table of integrals. 
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ANSWERS TO EXERCISES 


(a) v = kt?/2. (Ὁ) 12. (ce) mI#/2+ δ]. (dad) 13. (e) 23/3 — 82?2/2. 
(f) l/z. (g) x (h) 40?+ Bb+C = Aa*?+ Ba+C+ (b—a)QAE+ BO 
¢ = (a+ 6)/2. (i) Any number in (α δ). ()) sg in (0 1. ( [J in 
(0 1). (ὃ {Z| in(—1__1). (m) Since the distance is given by a quadratic, 
you may apply Exercise (ἢ). (ἢ) Since the derivative is zero everywhere, the 
function must be continuous and have a derivative. (0) 312,2 - 1+. 
(p) 27+cC. (q) 27+C. (ἡ mI?/2+01+C. (s) 251. -ἰ 2?/2+ 23 + 
φ 4- Ο. (Ὁ —(1 — x)?/3. (ἡ Differentiate the right member. 


ANSWERS TO PROBLEMS 


Always check antiderivatives by finding their derivatives! 11. The derivative 
is (3/2)(I7 + 1)!/2(27). We are not yet in a position to find a function whose 
derivative is (77+ 1)!/?. 17. Given by y = 27/2 — 3x+C. If it passes 
through origin, 0 = C. 


7~10 Measure and area. Often we are interested in the length, area, 
volume, mass, or some other measure of an object or set. Such measures 
tell us in some sense “how much” of something is present. For example, 
when confronted with a finite set of objects we might count them to get 
a measure of the size of the set. When confronted with a physical body we 
might be interested in its mass, its dimensions, or its volume. We might 
be interested in the length of a straight line or curve and in the area of a 
region enclosed by a curve, as suggested in Fig. 7-29. 


FIGuRE 7-29 


As suggested in Section 7-1, the problem of length and area is by no 
means simple for curves other than broken lines. We shall find it helpful 
to consider some properties common to all measures. To get at these, we 
consider the particular case of measuring finite sets by counting them. 
Imagine, then, a certain finite set together with all its subsets. Let M(A) 
be the measure of a subset A obtained by counting. Among the subsets 
is the null set @, and 


(1) τ ΜΙ) = 0. 
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For any other subset A, M(A) is a natural number. Hence 
(2) M(A) 2 0. 


From (6-3-2) we have 


(3) (AN B= ὁ) > [M(A UB) = M(A) + M(B)). 
Also, 
(4) (A cB) - [M(A) < M(B)). 


(a) Argue for (4). (Ὁ) When would the equality hold in the conclusion of 
this implication? 


The above properties are not surprising, but it 1s interesting that (1) 
through (4) hold also when we consider some other kinds of measure. 
For example, the mass of an empty set is zero (1), the mass of any body is 
a non-negative real number (2), the mass of a body consisting of two dis- 
joint bodies together is the sum of the masses (3), and the mass of a part 
of a body is not greater than the mass of the body (4). 

Or let us consider the set of all finite segments on a straight line. The 
measure consisting of the length of a segment satisfies (1) through (4) if 
we consider @ to be a degenerate interval. There are some additional 
properties of interest. For example, the length of a segment consisting of 
a single point is zero, so that @ is not the only set of points on the Jine with 
zero length. We may have set A a proper subset of set B and yet M(A) = 
M(B). For example, the length of (ᾳα δ) and (α δ) are the same since 
they differ by but a single point. Also we note that two congruent seg- 
ments have the same measure. 


Suppose we wish to extend the idea of measure to segments on curves other 
than straight lines or to sets other than intervals on a straight line. It would 
be natural to try to do this in such a way that (1) through (4) still would hold. 
In the following questions, assume that you are trying to define the measure of 
sets of points on a straight line so that the measure of an interval is its length 
and so that (1) through (4) hold. (ὁ) How would you define the measure of 
the segment (ab)? (d) Could you then prove that M(a_a) = 0? (ὁ) How 
would you define the measure of a set consisting of the union of two disjoint 
intervals? (f) What would be the measure of any finite set of points? 


Now let us consider in the same way a plane region and all rectangular 
regions in it. We define the area of a rectangle to be the product of its two 
dimensions. Then (1) through (4) hold if we consider the null set as a kind 
of degenerate rectangle. We should like to extend the concept of area by 
appropriate definitions to cover regions other than those bounded by 


436 CALCULUS [cHaP. 7 


A B A’ Ε B/ ΕἘ Ε΄’ b {’ hv ob [{“’ 
Figure 7—30 


rectangles. More precisely, we wish to define the area of any region en- 
closed by a curve in such a way that (1) through (4) hold, the area of a 
rectangle is the product of its dimensions, and the areas of two congruent 
figures are equal. Figure 7-30 suggests how we can derive formulas for 
the area of regions enclosed by parallelograms, triangles, and any curve 
consisting of straight lines. 


(g) Show how we get the formula for the area of a region bounded by a 
parallelogram by using (3) and the congruence assumption. (ἢ) Explain how 
the area for a triangular region is derived. 


Now we consider more general regions, bounded by curves other than 
straight lines. For definiteness, consider the region bounded by the 
interval (0 2) on the x-axis, the segment from (2,0) to (2, 4), and the 
arc of the graph of I? between (0, 0) and (2,4). The region is shaded in 
Fig. 7-31. To calculate its area is quite beyond our present methods, 
since it clearly cannot be divided up into triangles. However, consider the 
subset of our region shaded in Fig. 7-32. It is that portion of the region 
lying below the step function whose horizontal segments, each of length 
1/2, have their left ends on the curve at (0,0), (1/2, 1/4), (1,1), and 
(3/2, 9/4). We can easily calculate the area of this region by (8). It is 
(1/2)(1/4 + 1-+ 9/4). But since this region is a subset of the region in 
which we are interested, its area is no greater by (4). Letting A be the 
area sought and A, be the area of the union of rectangular regions, we 
have A, < A. Similarly, the area under the step function whose hori- 
zontal segments have their right endpoints on the curve (the upper bound- 
ary is given by a dotted line in Fig. 7-32) is (1/2)(1/4 + 1+ 9/4 + 4). 
Calling this area A,, we have A, > A. 


(i) Justify the formulas for 44 and 44... (j) Why are the notations 44 
and Aa (read “A sub 4 lower” and “A sub 4 upper”) appropriate? (k) How 
is (3) used? 


The significant thing about the above maneuver is that we have “boxed” 
A between two areas that can easily be found, that is, 4, < A < Ag. 
We have done this by dividing the interval (0 2) into subintervals and 
constructing on these subintervals rectangles, the area of whose union ap- 
proximates the area sought. By dividing the interval into a larger number 


7-10] MEASURE AND AREA 437 


y = 0 
Figure 7-31 FiGuRE 7-32 


of subintervals we might hope to enclose A as tightly as we wish between 
two areas. This follows because the tops of the rectangles would be 
closer to the curve if we made their bases smaller. 


(1) Draw a large figure similar to Fig. 7-32 but with eight subintervals of 
equal length. Calculate Ag and Ag. (τ) Find Aa and A4 as single fractions 
and find the difference 44 — 44. (n) If we approximate A by (44 + Aa)/2, 
what is the largest possible error we might make? (0) Do the last two exer- 
cises for the 8 subintervals. 


Now, let us imagine that we have divided the interval from the origin 
to (2, 0) into n congruent subintervals, each of length 2/n. The values of 
x at the points of division are then 0, 2/n, 4/n, 6/n, 8/n, ..., (2n — 2)/n, 
2. The heights of the rectangles are 0, 4/n”, 16/n”, 36/n?, ..., 4. We con- 
sider first the rectangles below the curve. The height of the 7th rectangle 
is (2i/n)? or 477/n?. Since its base is 2/n, its area is 82*/n®. Since ὦ runs 
here from 1 to n — 1, the area of the union of these rectangular regions is 


(5) | Doe 877 /n? = (8/n?) > ΠΣ 
i=l i=l 


Applying (6-7-23)(n:n — 1) to the summation, and letting A, stand for 
area given by (5), we have | 


(6) An = (8/n*)(n — 1)n(2n — 1)/6 
(7) = (8/3)(1 — 1/n)(1 — 1/2n). 


(p) Verify that the right members of (6) and (7) are synonymous. 
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If we work with the rectangles whose tops are above the curve, the 7th 
rectangle again has height 477/n? and area 877/n?, but now ὦ runs from 1 
ton. Calling the area of the union of these rectangular regions A,, we have 


8 An = μὴ Σ᾽ i? = (8/n)(n)(n + On + 0/6 
i=l 
(9) = (8/3)(1 + 1/n)(1 + 1/2n). 


It appears plausible that as we let n get very large the difference be- 
tween A, and A, gets quite small, so that they become very good ap- 
proximations to A. We wish now to define the area A in some way con- 
sistent with these ideas. To do this, consider the set of all A, for n a 
natural number. Clearly this set of real numbers is bounded from above, 
since they are all less than Ay, for example. Hence this set of numbers 
has a least upper bound by (6-9-6). We define the area A to be this least 
upper bound. An alternative definition might come from considering the 
set of all A,. It is bounded from below, so has a greatest lower bound. 
We could define A as this greatest lower bound. 

It is not hard to see that the least upper bound of the A,’s and the great- 
est lower bound of the A,’s ought to be the same. For as we let n get 
larger we see from (7) that A, gets larger. Indeed, for large enough n 
we can make 1/n and 1/2n so small that A, is as close as we like to 8/3. 
Similarly, as n gets very large we see from (9) that A, decreases and gets 
as close as desired to 8/3. Hence 8/3 is the lub, glb, and the desired area. 


(q) Use this method to find the exact area of the region bounded by the 
arc of I? from (1, 1) to (8, 9), the interval (1__8) on the z-axis, the segment 
from (1, 0) to (1, 1,), and the segment from (3, 0) to (3, 9). 


PROBLEMS 


In Problems 1 through 8 find 44, 44, An, An, and A by seeing what happens 
when 7 gets very large. In each case we are concerned with the area of the 
region bounded by the graph of f between (a, f(a)) and (0, f(b)) and the seg- 
ments joining (a, f(a)), (α, 0), (6,0), and (ὃ, f(6)), in that order. (See Fig. 
7-33). Draw figures carefully. 


ljf=P,a=0,b = 2f= 2,4 =2,b -- 8. 

3.f = I2,a = -- 1,0) = 1 4.f=I,a=0,6 = 2. 

ὅ. 7 Ξτ ]1,α Ξ 1,ὃ πὸ 8 6.f = 212, α = 0,6 = 2. 
7.f=P,a=0,b =c 8 f=mca=0,b =e. 

*9. Use Problem 13 in Section 6-7 to find the area defined by f = 13,α = 0, 


b= 1. 
10. Generalize for ἢ = I?,a = Ὁ, ὃ = c. 
11. Generalize Problem 7 to f = 12, a and ὃ unspecified. 
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b, f(b 
(b, f(6)) ἢ 


(α, 7(α)) 


(α, 0) (6, 0) 
FIGURE 7-33 


12. Find by the methods of this section the area of the region between the 
graph of mI + ὃ and the x-axis between the lines x = a andz = c. (Assume 
that γι, ὃ, a, and ¢ are such that this region lies entirely above the z-axis.) 
Check your result against the area found by formulas of plane geometry. 

13. How could the methods of this section be used to find the area of a figure 
such as that of Fig. 7-29? 


ANSWERS TO EXERICSES 


(a) If A is a proper subset of B, we would have members of B left over if we 
attempted to pairthem. (Ὁ) When A = B. (0) |b— al. (d) [α — αἱ -- 0. 
(6) As the sum of the measures. (ἢ) Zero. (g) In Fig. 7-30, A’D'E = 
BIC'F. (8) In Fig. 7-30, Ε΄ DF’ = ΟΡ ἢ“. Hence their areas are equal 
and equal to half that of the parallelogram. (i) The base of each rectangle 
has length 1/2. The heights are 0, 1/4, 1, 9/4, and 4. (7) They suggest the 
number of subdivisions. (k) We are adding the areas of disjoint regions. 
(1) 35/16, 51/16. (m) 7/4, 15/4, 2. (n) 1. (0) Difference is 1, largest 
error 1/2. (p) Recall that (1/a?)bed = (b/a)(c/a)(d/a). (q) 26/3. 


ANSWERS ΤῸ PROBLEMS 


1.9. 3.2/3. 5.4. 7. 63/3. 9.1/4. 11. 63/3 — a3/3. 13. Split 
the region by straight lines so that it appears as the union of disjoint subregions, 
each of which is similar to Fig. 7-33. Of course, we would need to know the 
functions giving the curve bounding each region on top. 


7-11 The definite integral. We wish to generalize the ideas of the last 
section in order to find general methods of calculating areas quickly with- 
out the tedious devices used in the last section. We wish to be able to 
deal with curves both rising and falling and crossing the z-axis. Moreover, 
it will become apparent that the generalization has many uses besides 
that of finding areas. 

We consider a function f continucie in the closed interval (a ὃ +). Par- 
tition the interval into n subintervals, by the points of division a = 2% < 
ἃ gS ea pe Os C= te ας δὴ ΞΞ ὃ, These sub- 
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intervals are not necessarily of equal length. Now f is continuous on each 
subinterval, since it is continuous on the entire interval. In the 7th sub- 
interval, (x;_1 2%;),f has an absolute maximum and an absolute minimum 
according to (7-6-1). Let these be ΜΝ; and m; respectively. 


(a) Review the discussion of continuity in Sections 7-3 and 7-6. (Ὁ) Com- 
piete: M; is the largest value of ________ for values of xin —______._ (ce) Com- 
plete: m; is ____. (d) To get a physical picture of the situation, let us 
imagine f(z) to be the rate of flow of water into a tank (say in gallons per minute) 
and x the time. Then what is (z;-1 2;)? What is (σι — 2;_1)? What is M,;? 
What is m;? 


We call this partition of the interval (a 6) into subintervals p. Then 
we form the sums J, and 7, defined by 


(1) Lp = > mile; — Xs_1) 
i=1 

(2) 1, ΞΞΞ > M x2; — ας...) 
i=l 


We call J, and 7, the lower and upper sums corresponding to p. 


(e) Referring back to the physical interpretation in Exercise (d), what are 
I, and I,? 


Now the function f has an absolute maximum and minimum on the 
whole interval (a__b). Let these be M and m respectively. It is easy to 
show that 


(3) mb—a)<Il,< I, < Mb — a). 


(ἢ) Show the first inequality. (g) Thethird. (h) Themiddleone. (i) Re- 
ferring to Exercise (d), what are m(b — a) and M(b — a)?. 


Consider the set of all values of J, for all possible partitions. Clearly this 
set of real numbers is bounded from above by M(b — a). Hence it has a 
least upper bound J. Similarly, the set of all upper sums is bounded from 
below by m(b — a). Hence it has a greatest lower bound 7. 


(j) Show that 1» < JandI<J,. κί) Show that J <7. (1) Refer- 
ring to Exercise (d), what are J and I? 


Now we are going to show that for f a continuous function (which is 
all we are considering here), J = 7. To do this we shall make use of the 
following theorem about continuous functions, which we state here with- 
out proof. 
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If f is continuous in the closed interval (α δ), then for any 
(4) positive € there is a ὃ such that in any subinterval of length less 
than ὃ, the maximum and minimum of f(x) differ by less than e. 


Suppose now that we are given any arbitrarily small e. We choose a 
partition of the interval (a__b) in which all the subintervals are of length 
less than the corresponding 6, whose existence is guaranteed by (4). Then 
for this partition, 


(5) Ty — Lp = Σ᾽ (Me — mae — 24-1) 
(6) < > ε(α; = Li—1) 
(7) < (b — αλ)ε. 


Now ὃ — ais fixed and € may be chosen arbitrarily small. Hence 7, — 
1» may be made arbitrarily small. But by the results of Exercises (7) and 
(k), Jp < 1 < I < T,. Hence the difference between 1 and J can be 
shown to be arbitrarily small. It follows that J — J = 0, since only 0 
is non-negative and smaller than any positive number. 

We now define the definite integral of f from a to b as 1 (or 1) when 
1 =T. We designate this definite integral by 


(8) [ "Ff οὗ [ "πῶ de. 


We prefer the first form, but the second is most commonly used. Its 
origin is as follows. In the above discussion let Az; = x; — 2;_1. Then 
the lower and upper sums (1) and (2) are sums of terms of the form 
f(x*) Δαν, where x* is the value of x in the 7th interval that yields the 
minimum or maximum. The f is an elongated S suggesting “sum,” and 
the dx suggests the Az;. The reason that the f is used also to designate the 
antiderivative will be explained in Section 7-12. 


(m) Suppose that f(z) is the velocity of a moving body and 2 is the time. - 
Interpret (αι. 1 σῷ 27 — 2;-1, mi, Mi, each term in (3), J, and I. (n) Do 
the same letting f(z) be the rate of growth of national income and the time. 
(0) Do the same if we let f(z) be the y-coordinate of a point on the graph of 
f where f(z) > 0 in (α δ). Relate the symbols of this section to those of 
Section 7-10. 


In the next section we shall present a relatively easy way of evaluating 
definite integrals, but in this one we are interested in meaning rather than 
calculation. For this purpose we now derive another expression for the def- 
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inite integral. As above, let f be a continuous function in (a_b). We 
choose the special partition into intervals of equal length, (b — a)/n. 
Then consider the sum 


(9) SoS fa + + i(b — a)/n)(b — a)/n. 


iM 


Here each term is the product of the length of the intervals, (ὃ — a)/n, 
and the function evaluated at the right-hand end of the interval. Clearly 
S, lies between the lower and upper sums corresponding to this partition. 


(p) Why? 


Now suppose we let n get larger. It appears that the corresponding 
upper and lower sums will come closer together and approach the integral 
I. Hence as n gets larger, S, — the definite integral. We write 


(10) " f = lim Sila + i(b — a)/n)(b — a)/n. 
a ne Gy 


We have not yet defined “lim,_,..” This is done as follows. 


(11) Def. limz 0 f(x) = 1] 
=Vee>O0-INe«>N > [1(]) -- Π] « ε. 


Roughly speaking, this says that “the limit of f(a) as x approaches infinity 
is L” means, by definition, that we can make f(x) arbitrarily close to L 
by taking x large enough. Our proof that J = 7 showed also that 


LIM n—+00 Ln Ξ-- LiM p-+20 is = 7 ; 


where J, and [,, are the sums corresponding to the partition of (9). Since 
In < Sn < T,, by taking n large enough we can make the summation in 
(10) differ from the integral by an arbitrarily small amount. Hence (10) 
holds. 

It is not necessary that we take the intervals to be congruent nor 15 it 
necessary to evaluate f(x) at the right endpoint of each interval. Suppose 
that p is an arbitrary partition such as we introduced at the beginning of 
this section, and let χσί be any point in the 7th interval, that is, x;_1 < 
rv, < x; Then we have 


(12) [ f = lim x fe xi ~ 24-1). 


NR i=l 
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Here it is understood that as πὶ — o, we form the corresponding parti- 
tions so that the maximum length of the intervals in each partition ap- 
proaches zero. 


(q) Argue for (12). (r) Why is it not surprising that when we let the num- 
ber of intervals get large and their lengths get small, we get the same result 
whether we evaluate f(z) at the left end of each interval, the right end, the 
place where f(x) is maximum, the place where it is minimum, or anywhere else 
in the interval? 


PROBLEMS 


1. In Fig. 7-34(a) divide the interval (a__}) into rour congruent intervals 
and draw verticals at the points of division. Draw the rectangles corresponding 
to (1). | 

2. Copy Fig. 7-34(a) and draw the rectangle corresponding to (2). 

3. On your drawings for Problems 1 and 2 show rectangles whose areas are 
m(b — a) and M(b — a) and explain how your drawing makes (3) appear 
plausible. | 

4. Make another copy of Fig. 7~-34(a) and sketch the rectangles corresponding 
to (9). 

5. Make another similar drawing illustrating (12). 

6. Carry through Problems 1 through 5 for Fig. 7~34(b). 

7. Having defined the definite integral precisely, we can now use it to define 
the area of a region bounded by the graph of a function, the z-axis, and vertical 
lines. In fact we define the area of the region bounded by the graph of f between . 


f 


(a) 


(b) 
FiguRE 7-34 
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Figure 7-35 


(a, f(a)), (6, f(6)), and the segments joining (a, f(a)), (a, 0), (ὦ, 0), and (6, f(®)), 
in that order, where a < ὃ and f(z) > Ὁ for all z in (a_b), as f?f. Suppose 
we found the integral in the case illustrated in Fig. 7-34(b). Why would we 
fail to get the area between the graph of f, the z-axis, and the lines z = a and 
zs = δ What would we get? 

8. Argue that f° |f! gives the area of the region bounded by the graph of f, 
the z-axis, and the lines z = a and x = ὃ regardless of the sign of f(x) in the 
interval. 

9. Suppose we wish the area of the region graphed in Fig. 7-35? How is 
this area related to f° f? 

10. Suppose f(é) is the velocity at time ¢ along a straight line. Does f 2 f 
represent the distance traveled in the time from a to ὃ, or does it represent the 
net distance traveled, i.e., the distance traveled in the positive direction less the 
distance traveled in the negative direction? Why? 

11. In each part of Fig. 7-36 evaluate f_}f, where the broken line is the 
graph of f. 

12. Suppose that f(t) is the rate of growth of population, where population 
may be increasing at some times and decreasing at others. What does [ : f 
represent? | , 

13. Discuss (10) and (12) in terms of the physical image of Exercise (d). 


ANSWERS TO EXERCISES 


(b) f(z); the {ἢ interval. (c) the smallest value of f(z) in the 7th interval. 
(d) The ith time interval. The length of the zth time interval. The largest rate 
of flow during the 7th time interval. The smallest rate of flow during that in- 
terval. (6) The total flow calculated on the assumption that in each interval 
the flow was constant and equal to the smallest actual rate of flow during that 
interval. Same, with (smallest:largest). (f) We make each term in (1) no 
larger if we put m in place of m;. Then m is a common term and the expression 
becomes m times the sum of the lengths of the intervals, that is, m(b — a). 
(g) Similarly. (ἢ) Since each term in (1) is not larger than the corresponding 
term in (2). (i) The total flow if the rate of flow had remained constant at 
the value γι. Same but (m:M). 

(1) Any member of a set is no greater than the least upper bound of the set. 
(k) Since we can find an element in a set arbitrarily close to the least upper 
bound (greatest lower bound) of the set, J > J would require some upper sum 
to be smaller than some lower sum. But this cannot be, for consider any [0 
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(—1/2, 1/2) 


(1/2, —1/2) 
(b) 
y 
(1, 1/2) 


(1/2, —1) 
(c) (d) 
FIGURE 7-36 


and I,. Let r be the partition formed by using the points of division of both p 
and g. Then 1, < I, by (3). But Jp < J;, because the minima in the new 
intervals cannot be smaller than those in the old. Similarly 7, > 7,. Hence 
Ip < Tq. (Ὁ The total flow. (τι) Interval of time, elapsed time in interval, 
smallest velocity in 7th interval, largest, distance if velocity constantly equal to 
smallest, distance if in each interval velocity had been equal to smallest velocity 
in that interval, same but largest velocity in each interval, same but largest 
velocity over entire period, actual net distance moved. 

(n) Time interval, length of time interval, smallest and largest rate of growth 
in ith interval, net growth calculated at minimum rate over entire period, 
minimum in each interval, maximum in each interval, maximum over entire 
period, and actual net growth. (ο) Interval on z-axis, length of same, area 
of rectangle of base the total interval and height equal to minimum of f in in- 
terval, area of rectangles below curve, area of rectangles above, area of rectangle 
with base the interval and altitude the maximum of f in the interval, the actual 
area. (p) Because the value of the function at any point in the 7th interval 
must lie between the maximum and minimum there. (q) The argument for 
(10) holds. (1) As interval gets small enough it makes little difference where 
we evaluate f(x), provided it is continuous! 
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ANSWERS TO PROBLEMS 

1. The top of each rectangle should not cross the curve but should touch 
it at least once, and no part of the curve should be below it in that interval. 
3. These rectangles are those we could get with just one subinterval! 4. Each 
rectangle touches the curve at the right endpoint of the interval regardless of 
the nature of the curve in the interval. 7. We would get the area of the part 
above the z-axis less the area of the part below. 9. It is the negative of the 
integral. 11. (a) 1, (b) 0, (ὁ) —3/2, (d) —1/2. 


7-12 The fundamental theorem of integral calculus. The theorem after 
which this section is named will enable us to evaluate many definite 
integrals with considerable ease. It may be stated for the case of contin- 
uous functions as follows. 

If f is continuous in (a__b), then 


(1) [ γ- D7 4b) — D7 f(a). 


This says that the definite integral of f from a to ὃ is equal to an anti- 
derivative of f evaluated at ὃ less the same antiderivative evaluated at a. 
It is sometimes stated in the following equivalent form. 


b 
(2) [ f = F(b) — F(a), where DF = f. 
(a) Review Section 7-9. (Ὁ) Why would it make no difference which anti- 
derivative were used in (1) or (2)? . 


To illustrate the use of the theorem, we consider again the area of Fig. 
7-31. Here f = 15. By the fundamental theorem, 


(3) [ P= αἰ γὼ — αὐ) = 8/2. 


(c) Similarly use the fundamental theorem to do exercise (q) in Section 7-10. 
(d) Verify some other results in the problems of Section 7-10. (0) Do 
Problem 11 in 7-10. 


To prove the fundamental theorem we shall need some properties of 
the definite integral. In 


(4) i ae i [ΤῸ az 


we call a the lower limit of integration, b the upper limit of integration, and 
f or f(x) the integrand. (Note that this use of “limit” has nothing to do 
with the meaning in Section 7-3.) It is evident that the value of (4) de- 
pends only on a, ὃ, and f. The letter z in the second form is a dummy, 
since the value of (1) is unchanged if we replace it by any other letter. 
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This is evident from the definitions in Section 7-11, since the upper and 
lower sums depend only on the images of the z; under f. The dummy 2 
serves merely as a suggestive indication of the independent variable. 
Hence it is natural, and increasingly common, to omit “(x)” and “dz,” 
as we have done in the first form. 


6) [Κα - fi soa. 


In Section 7-11 we assumed a < ὃ. The meaning of (1) is extended by 
the following definitions. 


(6) Def. [t= am ae 
(7) Def. [ f=0. 


One way of looking at (6) is to note that if we integrate from ὃ to a with 
b > a, the Az; in the lower and upper sums are negative. 


(Ὁ Show that (7) is essential if (6) is to hold when a = ὁ. (g) Verbalize 
(6) in terms of upper and lower limits. (h) Interpret (7) in terms of areas. 


We assume as before that fis continuous. Let m and M be the glb and 
lub respectively of the range of fin (ᾳα δ). Then 


b 
(8) mb —a) < f f < μῷ -- a). 


Geometrically this means that the area under f is less than that of the 
rectangle whose height is M and greater than the rectangle whose height 
is m, as suggested in Fig. 7-37. Analytically, it follows from (7-11-3) and 
the definition of the integral as J (or 7). 

It is intuitively evident from Fig. 7-37 that there is some rectangle on 
the base (ᾳ ὃ) whose area is exactly equal to the integral; that is, 


(9) Hm < HSM Af f= HO -- ὦ. 


(a, 0) (ξ, 0) (6, 0) 
FIGuRE 7-37. 
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The top of this rectangle is a line parallel to the z-axis between the lines 
y = mand y = M, as suggested in the figure. Of course (9) is true in- 
dependently of geometrical argument since H = (f?f)/(b — a) is a 
number satisfying (9). 

The line y = H meets the graph of f at some point in the interval since 
the graph must pass continuously from its highest point on y = M to 
its lowest point on y = m. Hence there is some point &é in the interval 
such that f(£) = H. That is, if f is continuous on (α 6), then 


(10) sEa<sE<bA {f= (ὁ — a)f(é). 


This theorem is called the law of the mean for integrals. Its proof need not 
depend on geometric imagery. It follows from a theorem about con- 
tinuous functions that we state here without proof. 


If f ts continuous in (a__b) and m and M are its minimum 
(11) and maximum there, then for any y in the interval (m_M) 
there is an x in (a_b) such that f(x) = y. 


Now, if we divide the members of (8) by (ὃ — a), we have 
1 
(12) ms eer <M. 
Hence the middle expression represents a number in (m_M), and by (11) 
there must be an zx (call it £) such that (10) holds. 
(i) Interpret (8) through (10) in terms of flow of water into a tank. 


Since definite integrals are limits of summations by (7-11-10) and 
(7-11-12), we would expect them to have some properties similar to those 
of summations. Indeed 


(13) fetaef t 
(14) im G+o=f[ +f g. 
(15) ΓΞ ΕΠ 


(j) Argue for (13) through (15) in terms of areas. (Κ) Argue for them in 
terms of flow of water into a tank. (1) From them prove (16). 


(16) fo-o=[r-fo. 
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7 + hy — FQ) 


7 + ἢ) 


4 (ἡ + h, 0) 


FIGURE 7-38 


If we accept theorems (10) and (15), we are in a position to prove 
the following theorem, from which the fundamental theorem can easily 
be derived. 


(17) D: [1 = f(a). 


This seems quite plausible in terms of our various interpretations, for sup- 
pose that f(x) is the rate of flow of water into a tank. Then f* f is the 
total flow into the tank in the interval of time from a to z, and the rate 
of change of total flow is the rate of flow. For simplicity of symbols, let 
F(x) be the integral from a to x of f. Let F(x) be interpreted as the area 
under the graph of f between the vertical lines through (a, 0) and (2, 0), 
as suggested in Fig. 7-38. Now F(a +h) = F(a) + A+B, as in- 
dicated in the figure. But A = hAf(x). Hence, by definition, DF(x) = 
lim [Af(x) + B]/h = lim [f(x) + B/h] = f(x) + lim (B/h), where the 
limits areas ἢ > 0. But B = (1/2)A[f(a + h) — f(x)], since it is prac- 
tically a triangle of base A and altitude f(a + h) — f(x). Hence B/h = 
[f(x + ἢ) — f(x)]/2 — 0 as ἃ — ὁ if f is continuous, and DF(x) = f(z). 


(m) Why is this not a proof of (17)? 


To prove (17), we let F(x) = f* f by definition and prove DF = ἢ 
(158) DF(x) = lim 


as follows. 
tim [fF — [ΠΛ {10 


| 
ay ime fe ae 
yon 


(21) = lim [hf(£)/h] for some £in (x 2x +h) (10) 


(20) = lim 


A—>0 


(22) =e f(x). 
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The final equality holds because « < ὃ < x+h, so that as h — 0, 
~— x. Then f(t) — f(x) because f is continuous. 

Now we can complete the proof of the fundamental theorem itself. 
Since DF = f, we have for any antiderivative ὉΠ, F = ΟΠ + ¢, 
where c is some constant function. [See 7-9-7.] Hence for every x in 
(α ὃ), F(x) = D7'f(x) +c. In particular, let τ = a. Then F(a) = 
D-1f(a) +c. But by (7), F(a) = 0. Hence c = D7'f(a), and for all x 
we have F(x) = D7 'f(x) — D7'f(a). Now we let x = ὃ to get F(b) = 
D~ f(b) — D~!f(a), which is just (1)! 


PROBLEMS 


The fundamental theorem opens up vast possibilities for finding areas and 
other quantities given by definite integrals. If we can find an antiderivative 
of f, we can immediately find any definite integral of which it is the integrand. 
In doing the calculations it is convenient to adopt the notation [Ὁ for 
F(b) — F(a). Then the fundamental theorem can be written briefly; 1.e., 


(28) [r= Dn. 
Thus we write 


(24) [Γ 13 -- iP 36 = 8/3 — 0/3 = 8/3. 


Sometimes, in fact most often, one sees such calculations written in terms of 
images using a dummy variable. For example, (24) becomes 


2.5 ἄγαν 
(28) Ϊ a dx = [810 = 8/3 — 0/3 = 8/3. 

0 
The calculation in terms of the function itself is easier and briefer, but the 
form in terms of images may remind the reader that the integral is the limit 


of a sum of terms of the form x? Az. It may therefore be preferable in writing 
about applications. 


In Problems 1 through 20 evaluate the definite integrals by using the funda- 
mental theorem. Present your work in one of the forms (24) or (25). 


3 3 
1. f i. 2. | χ᾽ dz. 
1 1 
1 2 
3. [ 1" 4. [ 6 Gee en Ἂς 
3 | 1 
=8 2 4 4/2 1/3 1/4 
5. [ dx/x°. 6. [ (ao “tae * +a") de. 
—10 


7. [ v2 + 1 (22) de. 8. [ : (I+ 3)". 
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0 0 SS 
9. | (21 — Py, 10. " ΜΊ — x3 de. 
1 0 
11. [ sin. 12, [ cos. 
0 0 
1 eS ΟΡ οσ π[ 2 
13. J /1 — tdt. 14. 1 cos [27]. 
0 0 
0 2 
15. [ cos: sin”, 16. [ qa se: 
1 
1 1 
17. [ Ta ay, 18. " ( -- “)} az. 
0 
2 2 b 9 3.2 
19. f I cos [1]. ae fi Ce Se 
1 a 


21. The rate of production of a firm from time ¢; to time ἐξ is given by 
mt + b. Find total production during the period. 

22. Find the area under —J?-+ 1+ 6 between the two points where it 
crosses the x-axis. 

23. Find f*, 7° and interpret your result. 

24, Show that (6), (7), and (13) through (16) follow from (2). (Note that 
this is the opposite of the logical development we used.) 
*25. Find the H and ξ for which (9) and (10) hold for the integrals in Problems 
1 and 13. 
*26. Argue from Fig. 7-39 that if the graphs of f and g meet at (a, f(a)) and 
(b, f(b)), and f(x) — g(x) > 0 in (α δ), then the area between the curves is 
given by the integral of f — g from a to ὃ. 
* 27. Find the area bounded by y = x? andy = z. 
* 28. Find the area bounded by y = 2? and y = ἡ". 
*29. Find the area bounded by y = x? and y = —a*-+ 5. 
*30. Argue from a figure that the area bounded by the graphs of f and g and 
the lines z = a and z = ὃ is given by the integral over the interval of |f — 9]. 
*31. Find the area between the graphs of y = x? and y = —2z?-++ 10 and 
the lines x = +1. 


(ὁ, 0) 
FiIGuRE 7-39 
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* 32. Water is flowing into a tank at the rate of δέ, where ¢ is the time, and is 
flowing out at the rate of —3t?. What is the change in the water in the tank 
from t = O0Otot = 1? from ἐ = 0 to# = 2? How much water is in the tank 
after 3 units of time if the original amount in the tank is 100? Graph the amount 
of water in the tank as a function of time from t = 0 tot = 3. 

*%33. Prove (13), (14), and (15) by using the definitions and theorems of 
Section 7-11. 

* 34. Argue for (26) through (29). [Law (27) is called the formula for integra- 
tion by parts.| | 


(26) fas fou 

(27) [1 Ὁ = 7.1 — [9 Oy. 

(28) af S-lol -- ΚΘ. f ll A @<E<d) 
(29) ela) - fe) SoM [F< fo. 


*% 35. When a force of magnitude F moves through a distance 8 in the direction 
of the force, the work done is defined as F's. If the magnitude of the force de- 
pends on x (the distance measured from an origin in the line of action of the 
force), then the work done in going from x = a to x = ὦ is defined as the 
integral from a to ὃ of F(x); that is, W = f° F(x) dx. Argue for the reasonable- 
ness of this definition. 

* 36. Suppose the force is given by sin z. Find the work done in moving from 
τ πίος = 0. 

*37. In a physics textbook the following integral appears: H = [ὦ 128 dt, 
where 77R = I?Rsin?wt and T is the period of sin2wt. Show that H = 
(1/2)I2RT. [Here R, In, and w are independent of t. Suggestion: To find an 
indefinite integral of sin?wt, use the fact that sin?wt = (1/2)(1 — cos 2wt).] 


ANSWERS TO EXERCISES 


(b) Using another antiderivative would add the same constant to F(b) and 
F(a). ὦ Sef = --  -» Sef = 0). (g) Interchanging the limits multi- 
plies the integral by —1. (ἢ) The area ofa line segment is zero. (i) (8): Total 
flow is between what it would be if rate of flow had been at the minimum and 
at the maximum. (9): There is some constant rate of flow at which total flow 
would have been the same; this rate is the average rate. [Note: The number H 
defined by (9) is called the average value of f(x) in (a__6).] (10): At some time 
the rate of flow was equal to the average rate of flow. (7) (13): Multiplying 
each ordinate by c multiples the area of each rectangle by c. (14): The area 
under f + g is the sum of the areas under f and g. (15): The area below f from a 
to ὃ equals the sum of the areas obtained by breaking up the region at some 
intermediate point. (The law holds, however, whether or not a <c < δ.) 
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(k) Multiplying the rate of flow by a number multiplies the total flow by that 
number. The total flow due to two flows is the sum of the total flows due to 
them separately. The total flow over a period of time is the sum of the flows 
over any two intervals into which the time interval is divided. (1) (13)(c:—1) 
and (14. (τω) It depended on a figure in which f(x) > Ὁ and on intuitively 
justified approximations. 


ANSWERS TO PROBLEMS 


1.20. 3.0. 5. 1/40. 7. Note that D,(x?-+ 1)8/? = (38/2)(x? + 1)1/?2x 
by the chain rule. Hence an antiderivative is (2/3)(x? + 1)?7. 9. —1/3. 
11. Since Dcos = —sin, an antiderivative is —cos; 2. 13. 2/3. 15. Dsin? = 
3 sin2cos. Hence an antiderivative is (1/3) sin?. 17. Note that D(I?-+ 1)?” = 
(3/2)(12 + 1)1/22T. 19. Ὁ sin [12] = cos [12]121. 21. (m/2)(t3 — t{) + b(te — 1). 
22, 125/6. 23. 0. The area below just balances the area above because the 
function is odd. 


7-13 Exponential and logarithm. The function defined by y = 6* is 
called the exponential function to the base b. We call it exp, or δὶ. We 
have defined b* for all rational x (Section 6-8). To avoid imaginaries, 
we assume b > 0. The range of x in b” was extended to all reals by (6-9-7), 
which we repeat here. 


(1) Def. δ" = lub fy|rre Ra Nr <xAy=0d*}. 


This defines δῖ for irrational x in terms of δ for rational r. It means, 
in effect, that we approximate δῦ for irrational x by means of δ᾽ for ra 
rational approximation of x. Now, the laws of exponents can be proved 
to hold for any real exponents, though we omit the proof. The domain of 
exp, is Re. In Fig. 7-40 we show 2’. 


FiGuRE 7-40 
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(a) What is 17? (Ὁ) What is the range of δ) (ὦ Graph 10] and 2? 
together. (d) Why does δ pass through (0, 1) for any 6? 


It appears plausible from the graph that δ᾽ increases with x when 
6 > 1, and that it is continuous. This means that any horizontal line 
above the x-axis crosses it once and only once, i.e., that y = δῖ has just 
one solution for x for each positive y. From this it follows that the con- 
verse of b’, defined by + = ὃ", is also a function. We call this inverse the 
logarithm to the base b. It is defined by 


(2) Def. logy = (67). 


The graph of logs is sketched in Fig. 7-40. Sometimes b/ is called the 
antilogarithm to the base ὃ. 


(6) Graph exp3 and logs together. (f) What about exp, forb < 1? (g) What 
are Rge (logs) and Dom (logs)? 


From (2) we have immediately 
(3) (y = logex) <> (ὦ = bY = expzy). 
Then from (5-10-8) and (5-10-9), 


pose 


(4) exps(logey) = = Ψ, 

(5) log,(b”) = loge(expsr) = 7. 
From the laws of exponents, 

(6) Ssh ot 

(7) : (0. = bY. 

From these we have 

(8) logy(u-v) = logyu + logge, 

(9) log,(u’) = v- logsu. 

To prove (8) we write 

(10) loge(u-v) = Ἰορο(δ᾽ πο". b°*”) (4) 

(11) = logs(bioe Fim) (6) 


(12) 


log pu -ἴΚ- logy (5) . 
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(h) Prove (9) similarly, and derive (13) through (16). 


(13) log,(1/v) = —log»., 

(14) log,(u/v) = logyu — logo, 
(15) logya = 1/logab, 

(16) Ἰοραῖ = ἰορρα - logza. 


The laws (8) through (16) make logarithms very valuable computa- 
tional aids. If we have a way to find logarithms and antilogarithms of 
numbers, we can reduce multiplications to additions and finding powers to 
multiplications. The most convenient base for such purposes 15 10. We 
write log x for logi9z. Below is a table of log x. 


x 10% 0.001 0.01 01 1 10 100 1000 10,000 


(17) 
log, Y —3 -—-2 —-1 0 1 2 3 4 


(i) Justify the entries in (17). (j) Argue that 2 < logi9157.3 < 3, 
—1 < Ἰοριοῦ < 1. (Κ) loge4 = ? (1) loga2 = ? (m) logel6 = ? 
(n) logV10 = ? (0) logo.50.25 = ? (p) logaz = —1/2, 4 = ? (q) Verify 
that (8) through (16) hold in (17). 


Table (17) serves to place the logarithm of any number to the base ten 
in an interval of unit length. For example, logi9150 - 2 + d, where 
0 < d < 1. The decimal part d can be found in tables and is called the 
mantissa. The integral part is called the characteristic. Logarithms to the 
base 10 are called common. Since 


(18) log (10 - x) = log 10 + logz = 1 + logg, 


moving the decimal point in a decimal changes the characteristic of its 
common logarithm but has no effect on the mantissa. Hence, in looking 
up mantissas we ignore decimal points. Then the characteristic is deter- 
mined by a knowledge of table (17), by noting that it is the exponent on 
10 when the number is written in scientific notation (see Section 6-11), 
or else by the rule of Problem 4 below. 

Table III in the Appendix gives three-digit approximations of the man- 
tissas of common logarithms of two-digit numbers, the first digit of the 
number appearing at the left and the second at the top. Thus log2 = 
0.301, log 5 = 0.699, log 8.6 = 0.934. The characteristics of all these 
logarithms are 0, since the numbers 2, 5, and 8.6 all lie between 1 and 10. 
However, log 86 = 1.934, log 860 = 2.934, and log 8600 = 3.934, as 
can be seen from (17). On the other hand, log 0.86 = —1 +- 0.934, 
log 0.086 = --2 - 0.934, log 0.0086 = —3+ 0.934. We could not 
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(520, 716) 


(510, 708) (516, 708) 


FIGURE 7-41 


(520, 708) 


write —1.934 for —1 -+ 0.934. To avoid negative mantissas, which are 
not tabulated, we write log 0.86 = 9.934 — 10, log 0.086 = 8.934 — 10, 
and so on. This amounts to adding 10 to the characteristic, then sub- 
tracting 10 from the logarithm. 


Find the common logarithm of: (1) 1, (8) 1.2, (t) 530, (υ) 5300, 
(v) 0.0053, (w) 0.00053. (x) Make a large-scale graph of log in the interval 
(( 10) by using Table III. . 


As is evident from the graph of log and from Table III, the curve is 
very closely approximated by straight line segments. Thus in Table III 
the difference between successive values of log x remains practically the 
same over a number of values of z, i.e., the slope of the curve is almost 
constant. We can use this fact to estimate logarithms by linear interpo- 
lation from Table III. For example, to find log 5.16 we note that log 5.1 = 
0.708 and log 5.2 = 0.716. In Fig. 7-41 we show the points (5.1, 0.708) 
and (5.2, 0.716) on the curve, and the secant line. The point (5.16, y) on 
the secant can easily be found, because the equality of the slopes of the 
segments on the secant implies that the corresponding differences of « and 
log x are proportional. Hence (5.16 — 5.1)/(5.2 — 5.1) = (y — 0.708)/ 
(0.716 — 0.708) and y = 0.708 + 0.005 = 0.713. The calculation is 
conveniently arranged as follows. 


ἥ 10gx 
510 | 708 
(19) 6 n | 
10 516 y {8 
520 716 


Then h/8 = 6/10, ἢ = 48/10 = 5, y = 708 - 5 = 713. Note that 
decimal points are ignored and all numbers are written with three digits. 
Calculations are rounded off to the nearest third digit. (Compare the 
discussion in Section 4-11.) 

Three-digit logarithms give roughly three-digit accuracy in computing. 
For more accurate work, tables of 4, 5, or even 15 or 20 digits are available. 
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Table III(a) in the Appendix is an expansion of Table III for 1.0 < x < 
2.0, where the interpolation from Table III is less accurate. In deciding 
how many digits to keep in calculations we follow the rules given in 
Sections 6-11 and 6-12. 


Find mantissas for the following numbers by interpolation in Table III: 
(y) 517, (2) 515. Find the mantissas for the following numbers by inter- 
polating in Table ΠῚ and compare with the results given by Table IIIa): 
(aa) 191, (bb) 155. 


To do computations with logarithms one must first decide on the pro- 
cedure and make a form. For example, to find A = (51.6) - (0.238) we 
decide to use (8)(u:51.6, v:0.238), then write down the form 


log 516. = ys 
-++ log 0.238 


| 


(20) 
log A — ee 


A =... 


Then we find the logs, insert them, perform the addition to find log A, 
then go into Table III “backward” (using it as a table of antilog x) to 
find A. From (19) and (17) log 51.6 = 1.713. Similarly, log 0.238 = 
9.376 — 10. Adding, we find log A = 1.089. Ignoring the characteristic, 
we look in Table III for 089. The table for interpolating in reverse is 


log x Ap 
079 | 120 
(21) 10 hy} 
35 | 089 δὲ 10 
114 130 


Hence h/10 = 10/35, h = 100/35 = 3, and x = 123. Now we note 
that the characteristic of log A is 1. Hence A = 12.3. Interpolation in 
Table III(a) yields practically the same result. 


Find the antilogarithms of the following: (cc) 0.883, (dd) 1.155. Carry 
out the following computations, checking your work by arithmetic: (ee) 2(4.18), 
(ff) 4/3, (gg) 2%. Find the following, being sure to make a good form before 
beginning the calculation: (hh) (3.14)(78). 


PROBLEMS 
1. Graph (1/2)/. 
ΧΩ. Prove as follows that b* increases with z when ὃ > 1. Note that we 
wish to prove that δ᾽ Τὰ» b* for h > 0, that is, 6*(b* — 1) > 0. Hence it is 
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sufficient to prove δὲ > 1 for h > Ὁ. Why? To prove δὲ.» 1, prove it first 
for hE N by induction. Then prove thatO “ ς “1 - οἴ <1 fornENn. 
Now prove that for n € N and ὃ > 1, 6!’ > 1, by assuming the contrary, 
that is, b!/" < 1, and showing that it contradicts 6 > 1. Combine these results 
to show that δὲ > 1 for h € Ra. Argue for the plausibility of the same result 
forh € Re. : 

3. Show that loggz = [y 3 δ᾽ = 1]. 

4. Justify the following rules: Count the number of decimal places from the 
space at the right of the first significant digit to the decimal point, counting 
positive to the right and negative to the left; the characteristic is this integer. 
To place the decimal point in an antilogarithm begin at the place to the right 
of the first significant digit and count a number of places equal to the charac- 
teristic. 


In Problems 5 through 14 find the logarithm of the given number. 


ὅ. 218. 6. 3.14. 7. 0.031. 
8. 215. 9. 0.00891. 10. 51.3 
11, 8.27 12. 0.0000194. 13. 0.101. 
14. 0.0777. 


15. Show that a negative number does not have a logarithm that is a real 
number. 
16. Why is interpolation less accurate when the number is near 1 in Table IIT? 


In Problems 17 through 24 find the antilogarithm of each number. 


17. 0.114. 18. 2.708. 19. 9.814-10. 
20. 8.211-10. 21. 0.329. 22. 5.128. 
23. 3.889-10. 24. 3.777. 


Use logarithms to compute the numbers in Problems 25 through 34. Make a 
form before starting computations, and approximate the answer by drastic 
rounding off. 


25. (3.14)(8.26)2. 26. ν 26. 

27, ὃν 1588. 28. (856)(35.2)—1(0.0154). 
29. 1/256. 30. (28.1) (252) /(27.35). 
31. (2/3)5. 32. (1.06)15, 

33. (0.99) 190, 34. (0.999) 100, 


ANSWERS TO EXERCISES 


(a) 1. (Ὁ) Positive reals. (ὁ) They cross at (0,1). The first is below at 
the left and above at the right. (ἃ) Since 9 = 1, (6) They should be mirror 
images in the graph of J since they are inverses. (f) The graph of expy is the 
mirror image in the y-axis of the graph of expc1/»), since δ = (1/b)—*. (g) Range 
is all reals, domain is positive reals. (8) For (9): loge(u’) = logs((b!oss")?) = 
log,(b7°8s") = v logy. (13) is (9)(uw,v:—1). (14) is (8)(@:1/v). (15) > 
(a = bl/logab) <> (gloeab = 6) <> (Ὁ = δ). (16) Similar to proof of (15). 
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(i) Refer to (3. (7) Since a < ¢c < b> loga < loge < logb. = (k) 2. 
(1) 1/2. (m) 4. (n) 1/2. (0) 2. (p) 1/2.  (q) Substitute for u and »v. 
(r) 0. (8) 0.079. (Ὁ) 2.724. (u) 83.724. (v) 7.724-10. = (w) 6.724-10. 
(x) Similar to are of loge in Fig. 7-40. (y) 0.714. (2) 0.712. (48) 0.281. 
(bb) 0.190. (cc) 7.64. (dd) 14.3. (66) Should agree when rounded to 
three significant digits. Same for (ff), (gg), (hh). 


ANSWERS TO PROBLEMS 


1. Mirror image in ἑ axis of 2. 3. (3). 5. 2.338. 7. 8.491-10. 9. 7.950-10. 
11. 0.918. 13. 9.004-10. 15. Since δ᾽ > 0 for b> Ὁ. 17. 1.3. 19. 0.651. 
21. 2.14. 23. 0.000000775, 25. Check by arithmetical calculation or use of 
tables. | 


*7-14 Growth and decay. From the definition of the derivative and 
the properties of log, 


(1) Dlog, x = lim [log (ὦ + h) — log x]/h 
0 


(2) — lim ᾳ /h) log (: zi h) 

h-0 v 
(3) = lim [((1/x)(x/h) log (1 + h/x)] 
(4) = (1/2) lim flog (1 + h/x)*""). 


(a) Justify each step. 


Evidently in order to find Ὁ log,z we must evaluate the limit as ἢ — 0 
of log,(1 + h/x)*/*. Since log, is everywhere continuous, we can do this 
provided we can find the limit of (1 + h/zx)?/". Note that as ἃ — 0, 
h/x > 0,1+h/x — 1, but 2/h — o. Hence as ἢ — 0, we have larger 
and larger powers of numbers that get nearer and nearer one. The limit 
is a certain irrational number, approximately equal to 2.72, which is 
designated by the letter e. Formally, 


(5) Def. | e = lim (1 + 1/a)’. 
aww 
We omit the proof that this limit exists. However, the student may ap- 
proximate it by expanding (1 + 1/a)* for large integral values. of a by 
the binomial theorem and observing that only a few of the terms in the 
expansion contribute significantly to the result. 
Letting a = +/h,a — ow ash — 0, and we have from (4) 


(6) D logst = (1/2) lim logs[(1 + 1/a)"] 
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(7) = (1/2) log,f[lim (1 - 1/a)"] 


(8) = (1 4) logve. 
(b) Justify (7) and (8). 


Now suppose we choose 6 as the base. Since log.e = I, 


(9) Dlog.z = 1/z. 


It is this result that makes e the most convenient base for scientific 
work. Logarithms to the base e are called natural logarithms. The loga- 
rithm to the base e is usually designated by In, defined by 


(10) Def. In = loge. 


Table IV in the Appendix gives In x in terms of x. By reading it back- 
ward we have a table of e”. 
From (9), 


(11) Din = I~’. 
It is interesting to observe that prior to this we had no function whose 
derivative was 1.1. Also, we were unable to find the antiderivative of 


1/I. Now we can find an antiderivative of any power function J”. 
Now, by applying (7-8-4) and recalling (7-13-2), we find 


(12) Dei = εἴἰ᾽.᾽έἌΗό (that is, De® = e”), 
(13) D exp, = exp, - (log.b) (that is, Db” = δῦ. log,b). 


We have found a function that is its own derivative! This means that 
we can solve an equation of the form Df(x) = f(x); that is, 


(14) (Df = f) — (cf = c-e!). 
More generally, 
(15) (Df =f-9) «» Acf=c-e&). 


This reduces to (14) by (g:1). To find (15) we might write the hypoth- 
esis in the form (Df)/f = g. But D[ln[f]] = (Df)/f by the chain rule. 
Hence this is equivalent to D[In [f]] = D[D~*g]. Then by (7-9-7), In (f) = 
D—-!g + C where C is some constant. By (7-13-38) this is equivalent to 
f = exp (D~1g + C) = [exp (C)]lexp (D~*g)] = c- exp (Ὁ 19). This ar- 
gument holds only for f such that f(x) > 0, since the domain of In is the 
positive reals. (See Problems 20 and 21.) 
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Imagine a population (human, animal, or botanical). It is natural to 
assume that the rate of growth is proportional to the size of the population 
y; that is, if x is the time and y = f(x) for some f, Df = k-f. This is on 
the assumption that the rate of reproduction per person and the death 
rate per person are fixed, so that k is the net rate of growth. Then from 
(15)(g:k), f = c-+exp (D-1k) = = c-exp(kI), that is, f(x) = ce** for 
some constant c. Letting the population be yp when x = 0, we have 
yo = ce°,¢ = Yo, and y = yoe** 


(16) (Df = Κ᾿ ἢ > [f(@) = 700) e*. 


This is the law of growth (or decay if k < 0) of a population with a con- 
stant rate of growth é per unit. 


(c) Suppose that the population rate of growth is 3%, that is, that k = 3/100. 
What will be the population after 3 years if it starts at 100,000,000? (d) Sup- 
pose that the population decreases at the rate of 3%. Answer the same question 
as that in Exercise (c). (6) Suppose that a body decays so as to lose 1/10 of 
its weight every 10 years. Find its weight in terms of time if its original weight 
was 5 pounds. (ἢ) What is the solution of Df(x) = kf(zx) if f(zo) = yo, that 
is, if we are given a value of f(x) corresponding to zo instead of to 0? 


PROBLEMS 


1. If interest is assumed to be compounded “instantaneously” at a rate 1, 
the total deposit y “grows” at a rate given by Dy = ty. Find the instantaneous 
compound interest law; i.e., find f so that y = f(/). 

2. At instantaneous rate of 5% what is the value 5 years from the present 


of $100? 


3. What amount must have been deposited 5 years ago to amount to $100 
now at 5% compounded continuously? 

4, Suppose that the rate of decay of a radioactive substance is —k, where 
k > 0, so that Dy = —ky, where y is the amount of the substance. Find the 
law of decay. 

5. The half-life of radioactive material is the time in which one half of the 
material radiates away, i.e., the x for which f(x) = f(0)/2. Determine this in 
terms of the & in Problem 4. Graph f for & = 1. 

6. The German psychologist Fechner (1801-87) assumed that for small 
increments AR and AS in the stimulus FR and the response S (yes, that is the way 
the letters are used by psychologists!), AS = cAR/R, i.e., the change in re- 
sponse is proportional to the ratio of the change in the stimulus to the stimulus. 
Writing this AS/AR = cR, letting AR — 0, and thinking of S as depending on 
R by some function given by S = f(R), we have DS = ¢/R. From this derive 
Fechner’s law, S = cln(R/Ro), where Ro, called the absolute threshold, is 
given by f(Ro) = 

7. When the electromotive force is cut off in a wire in which a current 7 is 
flowing, the rate of change of the current with time, Dj, is related to the re- 
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sistance FR and the inductance L by LDi-+ Ri = 0. Find the current in terms 
of time if 7 = 7 whent = 0. 

8. When the motor of a boat is cut, its acceleration D?s is related to its 
velocity Dis by D?s = —kD,s. Let Dis = ν, so that the equation becomes 
Dw = —kv, and solve to get v and finally s in terms of ¢ if s = so and v = v9 
when ¢ = 0. 

9. If a body is falling in a resisting fluid with velocity v, then Dw = A — Bu, 
where A and B are fixed. Show thatv = (A/B)(1 — e—**) satisfies the equation. 
*10. A more realistic assumption about rate of population growth in a limited 
environment is that it tends to decrease as the population approaches a maxi- 
mum Y determined by the environment. (Imagine the growth of a fly colony 
in a bottle!) A possible hypothesis is Dy = (1 — y/Y)ky or Df(z)/(1 — 
S(x)/Y)f(z) = k. Show that the left member is the derivative of —In(1 — Y/f(zx)) 
and so solve this equation in order to find the logistic law of population growth, 
y = Y(1+ ce—**)—!, where c = (Y — yo)/yo and yo = the population at 
the time x = 0. Sketch the curve for Y = 100,000,000, yo = 25,000,000, and 
find the points of inflection and asymptotes. Show that Dy — 0 asz — © and 
as r > —o, 

*11. Computations can be done with natural logarithms. However, (7-13-18) 
does not hold, and moving the decimal point changes the logarithm by the 
irrational number log.10. Table IV in the Appendix gives the natural loga- 
rithms of some numbers. By reading it backward we have a table of οἷ, The 
exponential function to the base e appears so frequently in scientific work that 
the “exponential function” and “exp” without indication of base mean expe. 
Note, in contrast, that “log” without base means logi9. We have In = exp* 
and log = (expio)*. Make a large drawing of In and exp. 
%12. Prove that the slope of In at (1, 0) is 1. 
#13. From Table [V find In 2, In 3, and In 6. 
*14. From Table IV verify that the slope of the secant joining two nearby 
points on In is approximately 1/z. 

15. Find D? In and explain why linear interpolation for finding log x is so 
accurate for large x but is less accurate for x near 1. 
%16. Find the area under 1/7 and between 1 and 3. 
*17. What is [Ὁ -11-ἰ] 0 
* 18. Show how (7-13-16) may be used to construct a table of natural loga- 
rithms if we have a table of common logarithms, or vice versa. 
*19. Show that 7 = (F/R)(1 — e—*®"£) is a solution of the differential equa- 
tion 1, 1 - Ri = EL. 
*% 20. Prove 


(17) Din |Z| = 1-, 
that is, Dz In [1] = 1/z. From this show that Ὁ 1η [7] = Df/f. 


ANSWERS TO EXERCISES 


(a) (1) by definition of derivative; (2) by properties of logarithms; (3) since 
1/h = (1/x)(x/h); (4) by (7-3-7). (Ὁ) Since log, is a continuous function; 
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by definition (5). (ὁ) y = 100,000,000e°-°3*; 100,000,000e°-°°, which can be 
evaluated by using Table IV backward. (Ἃἀ) 100,000,000e°-°9. Σ6ᾷ(6) Bet 
approximately. (f) yoe**—*o). 


ANSWERS TO PROBLEMS 


1. y = yoe*. 3. 100e~9-25, 5. The solution of 0.5 = e—**, that is, x = 
0.693/k. 7. ipe—*/t, 9, Differentiate and substitute. 


*7-15 Parametric representation. We have occasionally defined rela- 
tions by sentences of the form 


(1) | 722 -Ζ gf Ay=ht) AtEA. 


Usually the quantifier is omitted and the range A of the parameter ¢ is 
left implicit, but the pair of parametric equations z = g(t), y = hd), 
where A is the range of ¢, defines the relation 


(2) {(x, y)ldtz = gt) Ay = (ἢ Λ ἐξ A}. 


(a) Review (4-5-5), Problem 20 in Section 4-5, Problem 14 in Section 5-8, 
Exercise (6) and Problem 18 in Section 5-14, and Exercise (6) in Section 5-15. 


Usually the range of the parameter is some interval of the real numbers. 
For example, consider x = t?, y = ἐδ, t € Re. Solving the first equation 
for ἐ and substituting in the second, we find 


(y= 2? Vy = —27/*7) Ax> 0. 


The curve is sketched in Fig. 7-42. The equations resulting from elimi- 
nating the parameter are helpful, but the parametric equations are more 
convenient in finding points, since the square and cube are easier to cal- 
culate than the three-halves power. We can also get considerable informa- 
tion about the graph from the parametric equations directly. For example, 
we see immediately that the minimum value of x 15 zero, that the curve is 
symmetric with respect to the x-axis, and that y takes all real values. 
Note that the points on the curve are in the same order as the corre- 
sponding values of t, so that the parametric equations map the real line 
on to the curve in a way that preserves order. 

(b) Justify these statements. 

Under what conditions does (1) define a function? If g* is a function, 
then we can solve for ¢ = g*(x) in the first parametric equation and sub- 
stitute in the second to find y = h(g*(x)), which defines a function. If 


we restrict the range of x to Rge(g), this function is the same as that 
defined by (1), that is, in this case 


(3) Fix = gt) Ay=ht) Ate Alo [ν = AG*@)) A x © Ree(g)] 
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(4,8)pt = 2 


tl 
oe 
Ww 
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However, it 15 not necessary that g* be a function in order that (1) 
define a function. For example, consider x = t?, y = t?, t © Re. Elimi- 
nating ¢, we find y = x A x > O even though g* is not a function here. 

(ὁ) Show that déz =P Ay=PAtEGC Re y=2xAx2>0. (4) Find 
and sketch the function defined by x = cos*t, y = sin®t,tG Re. (e) Sketch 
the relation defined by x = cos*t, y = 2sint, ἐξ Re. Is it a function? 
(f) Sketch 2 = cost, y = 8sin*t, tG@ Re. Is it a function? (g) Sketch 
a = 2it],y = 1 —t. Is it a function? 

Often when the relation defined by parametric equations is not a func- 
tion it can conveniently be expressed as the union of two functions. In- 
deed if g* is the union of two or more functions, then (1) defines the union 
of the functions corresponding to each of these. For example, the relation 
of Fig. 7-42 is evidently 132) —J*/?. 1312 corresponds to the non- 
negative values of the parameter, and —J?/? to the nonpositive values, 
since (@=UAY=PAtECRYOl*@=_PAy=BALtD>O)V 
(@=t? Ay=t? At <0)]. The parametric representation is now 
expressed as the disjunction of two sentences in each of which the converse 
of the function giving 1 is also a function. 
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(h) Justify the above manipulations and statements. (i) Exhibit the re- 
lation of Exercise (6) as the union of two functions. (71) Exhibit the relation 
defined by x = sint, y = cost, A ἐξ (Q__27) as the union of two functions. 
Sketch. 

Whenever parametric equations define a function (or the union of func- 
tions) it may be of interest to find the derivative, if it exists. Since in this 
case we have y = h(g*(2)) and x = σ(ξ), the chain rule gives immediately 
Diy = D,yD,x or 


(4) Dey = Dyy/Dyx. 


From (4) we can find the slope of a curve defined by parametric equa- 
tions without eliminating the parameter. For example, applying (4) to 
c= t?, y = (5, we find y’ — 3t?/2t = (3/2)t. This formula gives us 
the slope of the graph of Fig. 7-42 at the point corresponding to ¢. When 
t > 0 it gives the values of the derivative of J*/?, and when ὁ < 0 it 
gives those of the derivative of —J*/*. The point (0, 0) lies on both func- 
tions. Strictly speaking there is no derivative there, since neither function 
is defined for x < 0. However, if we define a right-hand derivative by 
(7-4-1) with h restricted to positive values, (3/2)¢ gives the right-hand 
derivative for both functions. 


*(k) Formulate a definition of right-hand limit so that the right-hand deriva- 
tive can be defined as the right-hand limit of the fraction in (7-4-1). (The 
notation used is lim; _,¢+.) 

In each case find D,y in terms of the parameter: (1) x = 3t, y = 8; 
(mr =V1l—ty=VvV1itt; (n) ἃ =e y = Pet; (0) 2 = 2sinz, 
y = 3cos2. 

We have seen above how to deal with given parametric equations. 
We now wish to suggest how it is convenient to introduce parameters in 
order to find defining sentences of relations. Suppose we wish to discover 
the path that will be followed by an object near the earth’s surface (a 
rocket, for example) released with an initial velocity vg and subject only 
to the force of gravity. (We ignore such other factors as air resistance.) 
We place a coordinate system with origin at the point of release and y- 
axis through the earth’s center, as suggested in Fig. 7-48. Let vo = 
(V9, &o)p = (0 CoS a, Vg Sina) and Vv = (vz,vy), as indicated on the 
figure. It is a fact of physics that the components v, and v, of the velocity 
vector v at any time are independent of each other. Since there are by 
assumption no forces in the x-direction, we have vz = Dit = v9 COS ag 
from which x = (vg cos ag)t. In the y-direction, we have a freely falling 
body with initial velocity vp sin ag and acceleration —g. Solving this 
differential equation by the methods of Section 7-9, we find 


y = (vo sin αρ)ὲ — (1/2)g¢?. 
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Vo SIN ag 


Ug COS a 


FIGURE 7-43 FIGuRE 7-44 


These are the equations of Problem 14 in Section 5-8, from which we can 
find out everything about the path. 


(p) An airplane is flying on a course due west at an air speed of 275 mi/hr. 
The wind is blowing due south at a speed of 55 mi/hr. Find parametric equations 
of the path in terms of time. 


PROBLEMS 


In Problems 1 through 10 graph the relation defined by the parametric equa- 
tions. Exhibit nonfunctional relations as unions of functions where convenient. 
Identify the parameter geometrically if possible. In graphing you may find it 
helpful to eliminate the parameter, but try to find out also as much as is con- 
venient from the parametric equations. For example, the intercepts may be 
found by solving g(t) = 0 and h(t) = 0, symmetry may be investigated, verti- 
cal and horizontal tangents may be found by solving g/(t) = 0 and h’(t) = 0, 
D.y may be determined from (4), and so on. Note that points found should 
be linked in order of the corresponding values of ἐ. 


l. x = 2cost,y = 3sint,0 < t < 2m. 
2.2 = 2sint,y = 3cost,ic Re. 
3.2 = 2+1,y = 1,2 € (-τ-ὸὸ 2). 
4,.%e=1—t,y = 3+ 5,2 € (—1_1). 
5.2 = 2, y = [2. 
6.2 = l/t,y = 1/0. 
*7. 2 = tsint, y = tcost. 
8. 2£=14+8y=1-8@. 
9. χε (1+) y= ( -- ὃ: 1. 
*10.% = 10 - e 4 y = βἰπέ. 


11. A bomb is dropped from an airplane flying 585 mi/hr. Find the path,. 
neglecting air resistance. 

12. Solve the same problem if the plane is diving at 700 mi/hr at an angle 
of 45° below the horizontal. 

13. A circle rolls along the x-axis. What is the path described by a fixed 
point on its circumference? (Suggestion: Let the parameter be @ in Fig. 7-44, 
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where the point P starts at the origin so that OB has the same length as the 
arc PB.) The resulting curve is called a cycloid. Sketch it. 

14. Find formulas giving the velocity vector of the point in Problem 13, 
assuming that the circle rolls with constant angular velocity. Prove that when 
the point is at the top of the circle it is moving twice as fast as the center. What 
happens when it is on the z-axis? 

* 15. Prove that the normal to the cycloid of Problem 18 always passes through 
the point of tangency B, and that the tangent passes through the other end of 
the diameter through B. 

* 16. A circle rolls on the inside of a circle of four times the radius. Show 
that the path of a point on the circumference is given by x?2/3 + y?/3 = 2/3, 
where the circle is given by x? + y? = a?, and the path goes through the point 
(a, 0). The curve is called a hypocyclord. 

*17. Find parametric equations of the path of a point on a circle of radius a 
that rolls on the outside of a circle of radius 6. It is called an epicycloid. 

% 18. Derive parametric equations for Problem 16 with radii a and ὃ. What 
happens when a = b/2? 

%* 19. Find parametric equations of the square whose vertices are (1, 1), (1, —1), 
(1, —1), (—I, 1). 

% 20. Find parametric equations of the triangle with vertices (x1, y1), (x2, y2), 
and (x3, y3). 

*21. Show that (2) issynonymous with { (x, y)|Jé(z, y) = (g(t), A(t) A ἐξ A} 
and {z|3tz = (g(t), (Ὁ) A ἐξ A}. How would you interpret { (g(t), A(é))| 
ἐξ A}? Define it. 

22. Find the equation of the tangent line tox = at/(a + t®),y = at?/(1 + #8) 
até = 1. 

*23. Graph 2? + y? — 38ry = 0 by introducing a parameter through the 
substitution y = ἐξ and so determining parametric equations of the locus. What 
is the interpretation of ¢? This curve is called the foltwm of Descartes. 

*% 24. Graph x = a(3 — #7)/(11+ #7), y = αἰ(8 — 27,7. + #7), with a > 0. 
Eliminate ¢ and find a single defining equation. The curve is called the trisectriz 
of Maclaurin. 


ANSWERS TO EXERICSES 


(b) Since ¢? > 0 and 0? = 0; for every point (¢?, 5) we have a point ((—#)2, 
(—t)3) = (t?, —#?); and ¢? takes all real values. (c) If there is a ¢ for which 
x = 2 and y = ἐ2, then certainly y = x and x > Ὁ. Conversely, if y = zx 
and x > 0, we can find at, namely Vz for which y = ἐ2 and zx = ??. 

(4) {@yly=l—zA0<2< 1}. (0) {@yl4¢4+y2? =4A0K< 
a< 1}. No. (ὃ {(z,y)|3a7+y =3A —1<2< 1}. Yes. (g) {(2,y)| 
g= 2\1—yl}. No. (δ) ἐξ Reot>dVi < Oand (2-5-4). 

Gi) {@yly = 20 —2)7A0S eS 1 U{@%y y = —A1—2)7A 
0o<2r<1}. φα --υδὺ -- αἰ -- 1:5. (k) In (7-3-1) omit the 
absolute value signs around x — a. (i) #2 (m) -( — ὃ12(. - )7!/. 
(n) —e**(t?+ 2). (0) —(3/2) tana. (p) x = —275t+ 20, y = —55t+ yo. 
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ANSWERS TO PROBLEMS 


1. Ellipse. 3. Half-line. 5. y = (x/2)?/8. 7. Spiral. 9. Hyperbola. 
11. Rocket problem with ag = 0, v9 = 585. 13. α = α(θ — sin 8), 
y = a(l — cos θ). 


* 7-16 Arc length. We now return to the problem of the length of arc 
discussed briefly in Section 7-1. Suppose that the curve is given by 
x= g(t), y= h(tt),a<t<b. We partition the interval (ab) into 
subintervals given by a = tg < ty < tg < ... « tyn-1 ἐμ = δ. The 
points on the curve corresponding to the points of division are (20, Yo), 
(αι, ψ1)», ὩΣ, (σὺ Yi); 455. (Ln, Yn), where i g(ts), Yi = h(t). Imagine 
these successive points of division Joined by straight line segments as in 
Fig. 7-2. The length of the 7th segment is ~/ (ας — x:~1)? + (yi — Yi-1)?, 


so that the length of the broken line approximating the curve is 
t= 

(1) DV es — χα) + Ye — ye-1)?. 
i=l 


It seems natural to define the length of the curve as the limit of this sum 
as the maximum of the (¢; — ¢;-1) approaches zero. 


(a) Argue that if the maximum of the (ἐ; — t—1) approaches zero, then 
n— ©, (0) Is it certain that the limit of (1) always exists? 


To put (1) in a form so that its limit may be calculated by evaluating a 
definite integral, we use the law of the mean (7-9-8) to write x; — 2,-1 = 


gts) — gi) = 9 (D(a — ἐξ -α)ὶ and yi — yin = WEY — ἐγ 1). 
Substituting in (1) and simplifying, we have 


(2) >) νιν] + [he I2ts — ἐν...) 
i=l 

This is almost in the form of (7-11-12) with f(x) :[g’(é)]}? + [A’@]?. It 
would be exactly in this form if ἐς = ¢’. 

(c) Why does it make no difference that we use tinstead of x here? (d) Why 
t.’ = ἢ (0) Why can we not assume ¢/’ = ἐς 

If ει = { in (2), its limit as the maximum of the ¢; — t;—1 approaches 
zero would be just 


(3) i ” [Da]? + [Day]? dt. 


It appears that as the lengths of the subintervals approach zero the dif- 
ference of ¢/ and ἐξ become negligible. Accordingly it seems reasonable to 
take (3) as the definition of the length of the arc of the curve given by the 
parametric equations. 
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(Ὁ What is one advantage of using parametric equations in this discussion? 


As an illustration of the use of (3) we find the familiar formula for the 
circumference of a circle. Letting the circle be defined by x = α 608 ἐ, 
y = asint,t © (0 27), we have 


(4) C= [ a αν sin2t + cost αἱ 
0 
(5) = af” at = αἰότ = 2ra. 
0 


(g) Find similarly a formula for the length of are of a circle with central 
angle @. 


If the curve happens to be the graph of a function, y = A(g*(x)) = 
f(x), then we have y; — yi-1 = f'(x))(a; — 2-1), and (1) becomes 


(6) VTE UGE (0: — 21-2), 
i=l 


from which we have 


(7) f " V/T + (Day)? dr, 


where a = g(a) and 8 = g(b). Note that the limits are the values of x 
corresponding to the endpoints of the arc. 


(h) Suppose the curve is the graph of x = g(h*(y)) = Fy). Write an 
integral for the arc length in terms of y. (i) Show that if s is the arc length 
measured along the curve given by x = g(t), y = h(t) from (g(a), h(a)) to 
(g(t), A(é)), then 


(8) D,s = [(Dyx)” + (Dy)7]"”. 


(j) Show that if s is the arc length measured along the graph of y = f(x) from 
(σι, f(x1)) to (z, F(x)), then 


(9) D.s = [1 + (D.y)]"”. 
(k) Find the length of the arc of the curve of Fig. 7-42 from (1, 1) to (4, 8) by 
using the parametric form. Check by using the equation y = «°/”. 

PROBLEMS 


In Problems 1 through 10 find an integral giving the arc length on the curve 
between the indicated points. Sketch. Evaluate the integral if you can. 


Ly = 22 =0 toe = 1. oy = 2" = Ο Τά ἃ = 1. 
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" (r + Ar, 6 + Δθ)» 


=(Ar)? + (r 6)? 


Figure 7-45 


y = |z|,2 = —ltog = 2. 

(65 + e-*)/2,2 = Otoz = 1. 
cosz,x = Otox = 2π. 

Inz,x = ltoz = 10. 

ssint, y = 3cost,t = Otot = 5. 
3sint, y = 2cosit,t = Otot = 7/2. 
χ2- 45 --- δ, κ = —5tozr = 1. 

10. 2 = tsint, y = ἐοοδέ, ἐ = Ο ἰοέ = 1. 


he aoe eee 
ee eeee 
Ι 


*11. Suppose a curve is given in polar coordinates parametrically by r = ρ(), 
6 = hit),a <t< b. With the aid of Fig. 7-45 argue for the plausibility of the 
following formula for the arc length. 


(10) [ ἤν (Dr)? + (DO)? dt. 


*12. Similarly show that if the curve is given by r = f(@), then the arc length 
from (41, f(@1)) to (82, f(@2)) is 


(11) [ ? 72 + (Der)? dé. 


* 13. Find a formula for the are length if the curve is given by @ = f(r). 

% 14. Use polar coordinates to find the circumference of a circle. 

*15. Find an integral giving the arc length of r = 26 between @ = 0 and 
θ = 1. 

*16. In terms of polar coordinates find the length of the straight line joining 
(0, 0) and (1, 1). 


ANSWERS TO EXERCISES 


(a) Since the number of intervals must be no less than (ὃ — a) divided by the 
maximum. (Ὁ) No. Not every curve has a length. (0) These variables are 
dummies. (d) In (7-11-12) f(x{) means the result of substituting for 2 in 
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f(x), and we cannot substitute two different things. (6) The values asserted to 
exist by the law of the mean may not be the same. (f) The curve may not 
represent a function. (4) (4-9-1). (h) The integral from h(a) to h(b) of 
the square root of 1 + (Dyz)?. (i) (3) and (7-12-17). ()) (7) and (7-12-17). 


ANSWERS TO PROBLEMS 


1. V2. 3. 3V2. 5. ΤΡ ΝῚ + sin22 dz. 7. 15. 9. Integral of (422 + 16x 
+ 17)!/2 from —65 to 1. | 


*7-17 The differential. The definite integral has a geometrical inter- 
pretation that is easily visualized—the net area between the graph and 
the z-axis. The derivative, on the other hand, is a little elusive. True, 
it may be conceived as the rate of change of the dependent variable with 
respect to the independent variable or as the slope of the line tangent to the 
graph. But there is no immediately evident geometric entity whose 
measure is the derivative. 

As pointed out in Section 7-4, the derivative is equal to the change that 
would occur in the dependent variable if the rate of change were constant 
and the independent variable changed by one unit. This is illustrated in 
Fig. 7-46. Since the slope of the tangent line T is equal to the derivative, 
we have AB/1 = Df(x) and AB = Df(x). If f were a linear function we 
would have Df(x) = f(z + 1) — f(x) = Af(x). But ordinarily Af(x) — 
Df(x) = BC = 0. The derivative is an approximation to the actual change 
in the function when the independent variable increases by one unit. 
Economists find this a convenient way to think of the derivative, because 
in economics one unit is often “relatively small” and BC + 0. However, 
1 may often be a “large” change, i.e., the error in thinking of Df(z) as the 
change in f(x) may be “large.” 

In Fig. 7-47 we indicate an unspecified increment Az in the independent 
variable. Again the slope of the tangent line is equal to the derivative, 
that is, AB/PA = Df(x), or AB = Df(z) Az, as indicated in the figure. 
Here Df(x) Av is an approximation to the change in f(x), Af(v). The error, 


FIGURE 7-46 
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(2, f(a) 


FigurE 7-47 


BC = Af(x) — Df(x) Ax depends on Az, but evidently approaches zero 
as Ax -- 0. More than that, the error approaches zero faster than Ax 
in the sense that BC/Ax = [Af(x)/Ax — Df(x)] — [Df(x) — Df(x)] = 0 
as Ax —- 0, This means that as Ay — 0, the error is small compared 
with Az. 

We adopt the symbol df(x) for Df(x) Av. In Fig. 7-48 we show the 
increment Az in x, the increment Af(x) in f(x) and df(x). Note that df(x) 
is the directed distance to the tangent line and is an approximation to 
Af(x). 

(a) Draw a figure in which Ar < 0,0 < Af(z) « ἀξία). Label it carefully 
to correspond with the figures in the book. (Ὁ) Do the same for Az > 0, 
Af(z) < df(x) < 0. (c) Do the same for Az > 0, df(x) < Af(z) < 0. 
(d) Do the same for Ar > 0, Af(x) < 0, df(x) > 0. 


It is obvious from the above discussion that “df(x)” is not a constant. 
Its value depends on x and Az as well as on f. It gives an approximation 
to Af(x) that depends on the function, the point on it, and the change in 
the independent variable. We shall sometimes use the notation “dy” in 
place of “df(x)” to suggest an approximation to the increment Ay, as in- 
dicated in Fig. 7-49. Then we write dy = Df(x) Ax. Evidently, for a 
given f, the variables here are “x,” “dy,” and “Av.” For fixed x and hence 
fixed Df(a), we have a direct variation in which the constant of propor- 
tionality is Df(x), that is, the direct variation Df(x)Z in which “dy” is the 
dependent variable and “Az” is the independent variable. 


(6) Find dy and Ay for y = 27, x = 2, Ax = 1, 0.1, and 0.01. (ὃ Do the 
same fory = Inz,z = 1,Axv = 2,1, 0.5, 0.1, 0.01. (g) Estimate (2.000001)? 
by using the fact that dy = Ay. (ἢ) Similarly estimate ln (2.0035). 


A very interesting result appears if we apply the chain rule to find 
df(g(t)). Suppose y = f(x) and x = g(t), so that y = f(g(t)). By the 
chain rule, D,y = Df(g(t))Dg(t). Hence dy = Df(g(t))Dg(Z) At. But we 
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(x, f(x) 


Figure 7-49 


have also dy = Df(x) Av and dx = Dg(t) At. Now for given values of ὁ 
and At, we have corresponding values of « and Av for which we should like 
the two formulas for dy to give the same result. But the first one becomes 
dy = Df(x) dx if we replace Dg(t) At by dz and g(t) by x. Now we have 
Κ΄ (α) dx = f’(x) Ax, which implies dz = Ax. This is a contradiction, 
since there is no reason to expect dx = g’(t) At to be the same as Av = 
g(t + At) — git). 

The above contradiction can be eliminated if we use “dx” in place of 
“Az.” Accordingly we write dy = Df(x) dv according to the definition 


(1) Def. [dy = Df(x) dx] = [Dry = Df()] 


We call dy and dx the differentials of y and x. By definition, to say that the 
differential of y is equal to a certain expression times the differential of x 
is the same as to say that the derivative of y with respect to z is given by 
this expression. For example, dy = 2.x dx means the same as D,y = 22. 
Definition (1) does not define “dy” or “dx.” These are simply variables. 


(i) dy = e*dx, Dzy = ? (ji) y =  ( — 2)?, dy = ὃ 


_Now, if we interpret dx as the increment Az, then dy is approximately 
the corresponding increment Ay = f(a -+ Ax) — f(z). This is the con- 
venient interpretation if we think of « as the independent variable. On 
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the other hand, if we think of 2 in its turn as depending on a third variable 
t, according to « = g(t), then we have by (1) dx = Dg(¢) dt and dy = 
Df(x) dx. We get dy = Df(g(t))Dg(é) dt either by replacing dx by Dg(é) dé 
in the second equation or by applying (1) directly to y = f(g(¢)) and using 
the chain rule. 

Clearly if we know the differential of a function we also know its deriva- 
tive, and conversely. It appears that nothing new has been added. But 
the notation has the advantage that it suggests the relation between the 
change in the independent variable and the change in the dependent vari- 
able. More precisely we may think of the values of the differential as ap- 
proximating the changes in the dependent variable corresponding to given 
increments in the independent variable. This advantage is important to 
scientists who find it more convenient to think in terms of small changes 
than in terms of rate of change. If dz τέ 0, [dy = Df(x) dx]  [dy/dz = 
Df(x)]. Suppose y is distance, x is time. Then the left member of the last 
equation suggests the change in distance (actually, dy is only approxi- 
mately the change in distance) divided by the change in time, i.e., the ve- 
locity. Also dy = Df(x) dx says that the distance traveled (approximately 
dy) equals the speed (Df(x)) times the time elapsed (dx). Moreover, as we 
have seen the error is small relative to dz, so that scientists can safely 
think in terms of small changes in order to get results that are precisely 
correct in terms of rates of change. 

Another reason for the use of the differential notation is that many 
results become “obvious” by mechanical rules of manipulation if we use 
differentials. For example, suppose we have the parametric equations 
z= g(t), y = h(t). We write dx = g(t) dt, dy = "' (ἢ dt, and by division 
dy/dx = h’(t)/g’(t), which is just (7-15-4). Of course, this is not proof 
of (7-15-4)! We have simply defined differentials so that they give the 
same result. Now that we have defined them, we may treat the “dy” and 
“dx” in “dy/dx” as separate variables to be manipulated like numbers. 
The way in which we have formulated the definition makes this possible 
without contradiction. 

(k) Clearly, in terms of differentials dy/dz = (dy/du)(du/dz). Why? Is 
this consistent with our previous results? (1) In Fig. 7-388 show that the area 
A equals the differential dF (x) of the total area F(z). 

As suggested by the last exercise, the differential notation has con- 
ceptual advantage in relation to the definite integral. If we compare the 
two expressions 


(2) Disc) ar, and [ζῴ) dz, 


we see that the second is reminiscent of the first. Indeed, in setting up a 
definite integral we may think in terms of small increments but write in 
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terms of differentials. For example, work W is defined as force times dis- 
tance F's when the force is constant. If the force varies, we may think as 
follows: the force is almost constant for a short distance ds, so the incre- 
ment dW of work in that short distance is about Fds, and the total work 
done is the sum of such increments. But dW = Fds implies dW/ds = F, 
W = fFds, and W2 = f® Fds, where W2 is the work done in going from 
s=atos= ὃ. Thus we think in terms of small changes and sums, but 
we write differentials and integrals. The notation is so designed that the 
results are correct, provided our intuition leads us to use differentials 
properly in a given problem. 


(m) Compare the discussion of (7-11-8). (ἃ) Discuss and show F(x) = 
Jf dF(x). (0) Suppose an organism’s weight W depends on time t. Interpret 
W = J§4@W in terms of small changes in W. Interpret W = (Ὁ W’dt in terms 
of small changes in time. 


For reasons indicated above and for others, one of which we indicate 
in the next section, the differential notation is very widely used. We did 
not introduce it sooner because it is a notational device that distracts 
from the basic ideas of calculus and may lead to serious misunderstanding. 
Indeed, it is largely a device for doing calculus manipulations without 
thinking in terms of the essential underlying idea of limits. It was invented 
by Leibniz (1646-1716) at a time when the limit concept and: the funda- 
mental ideas of calculus were not understood and one had to depend upon 
intuition to avoid errors. Accordingly, the differential notation was a very 
helpful device and still is useful. The person who continues to use calculus 
should use the derivative or differential notation according to convenience. 
Thus he may write d cos x = —sin x dx or Dcosx = —sinz (or Deos = 
—sin) according to convenience. 


(p) What is wrong with d cos x = —sinz ordcos = —sin? 


On the other hand, the student should be aware that the manipulative 
advantages of the differential are very special. It does not follow that this 
notation can be extended with equal advantage. For example, the differ- 
ential notation d?y/dz? is often used for D2y. Suppose we define a second 
differential by d?y = D2y(dx)*. Now if y = f(x), x = g(t), and y = 
f(g), then by the chain rule Dy = f’(g(é))9’(), O?y = f’(g))9"(® + 
f(g)’ ))?, and ἀδν = [(δ)σ΄ (δ (dt)? + 7’ (δ) (Ὠ)" αἢ". But 
directly from the definition, d?y = f’’(x)(dx)? = f’(x)(g’(b dt)? = 
f’' (2) (9’(t))?(dt)*. This contradiction suggests that the differential nota- 
tion does not apply to higher derivatives with any advantage. Though 
we may safely think of dy/dx as the ratio of two differentials, we cannot 
safely think of d?y/dx? as the ratio of d?z and dx”, since with this latter 
interpretation our results depend on what variables we use! Instead, 
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d*y/dx? must be thought of as “. πα that is, as the result of applying 
x dx 


" twice. The D2y (or y’’) notation is much more convenient. 
x 

(q) Did we solve the problem posed in the first paragraph of the section by 
finding a visualization for the derivative? (r) What would be the objection 
to defining the second differential as the differential of the first differential? 
* (s) How would it work to define the second differential as the differential of 
the first derivative? (t) Carry through the differentiations in the previous 
paragraph. 


PROBLEMS 


. When does dy = Dzy? 

. Show that if f is a constant function, df = 0. 
. For what function does dy = dx? 

. Show that d(uv) = udv + vdu. 

5. Use the result in Problem 4 to show that the relative error in a product is 
approximately the sum of the relative errors in the factors. (Note that if we 
think of u and v as the true values and du and dv as the errors, απ τ is the relative 
error.) | 

6. Show that d(1/y) = —dy/y?. 

7. Use differentials to estimate (8.993)?. 

8. Estimate (5.0003)°. 

9, Estimate sin 0.45 by differentials and compare with the estimate obtained 
by interpolation in Table IT. 

10. Why is it reasonable from an intuitive point of view that we should have 
dy = Ὁ at a maximum or minimum if we think of dy as the change in y when x 
changes slightly? 


m wh καὶ 


In Problems 11 through 16 find dy. 


11. y = sin x?. 12. y = εττῦ 2 
13. y = In (sin 2). 14. 27+ y? = 4. 
15. 2 = cosy + e%™, 16. y = (a — «Ξ2)ὅ — x7 In A(z). 


17. In terms of differentials it appears obvious that D,z = 1/Dzy since 
dy/dx = 1/(dz/dy). Why is this justified by (7-8-5)? 

18. Often the simplest differential equations are written in terms of differen- 
tials instead of derivatives. Indeed, this is the origin of the name. Solve the dif- 
ferential equation dy/y = dx/z. 

*19. The derivative was at one time called the “differential coefficient.” 
Explain. 

* 20. Before the development of a logically satisfactory theory of calculus, the 
derivative was conceived of as the ratio of two very small changes (differentials) 
that were smaller than any finite quantities yet not actually zero. Criticize this 
formulation and explain why it does nevertheless serve for practical purposes 
in elementary calculus. 


-_ 
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21. Show that the relative error in a quotient is approximately the difference 
of the relative errors. 

22. Show that the relative error of an approximation is cut approximately in 
half when we take the square root. 
% 23. Show that the relative error in the nth power is approximately n times the 
relative error in the base. 
% 24. Show that the relative error in Inz equals approximately 1/In 2 times 
the relative error in the number z. 

25. Rework some of the Problems in Section 7-15, using differentials. 
* 26. Prove that 


(3) (ds)? = (dx)? + (dy)?, 


where ds is the differential of are length. Draw a sketch showing the geometrical 
interpretation of ds. 

* 27. Show how (7-16-7) can easily be found from (7-16-3) by using 
differentials. 

*% 28. Show that in polar coordinates (ds)* = (dr)* + r7(d6)?.. Draw a sketch 
showing ds in this case. 

* 29. When a string is stretched an amount z, the force required is kx, where k 
is the force constant. Find dW, the differential of the work done in moving a 
distance dz, and so set up and evaluate an integral for the work done in stretch- 
ing a spring from z = Ὁ tox = ὃ. 

*30. Show, using differentials, that a = vD,v, where v = Dz anda = D?z. 

* 31. In what sense may the equation dy = γ΄ ἀξ be interpreted as the equation 
of the tangent line? 

%* 32. By definition the work done by a constant force F exerted through a dis- 
tance x in the direction of the force is Fz. The average power P is defined as the 
work divided by the time, Fx/t. In a textbook on physics we find, “If the time 
interval is made extremely short” then this last formula becomes Fdzx/dt, so 
that the instantaneous power P is Fv, where v is the velocity. Explain this. 
Derive P = Fv, by using derivatives or differentials, from the definition 
P = dW/adt. (Incidently, in the same book, this definition is Justified by saying, 
“Tf the rate of doing work is not uniform, the power at any instant is the ratio 
of the work done to the time interval, when both are extremely small.”) 

* 33. Pressure on a surface is defined as the force on the surface divided by the 
area. However if the force is not uniformly distributed, this merely gives aver- 
age pressure. In Sears and Zemansky’s University Physics we find, “We ... 
define the pressure at any point as the ratio of the normal force dF exerted on a 
small area dA including the point, to the area dA.” Explain. 

* 34. For further examples of reasoning using differentials, see in Sears and 
Zemansky’s University Physics the discussions of Pascal’s Law, forces against a 
dam, specific heat, thermal conductivity, the differential form of the first law 
of thermodynamics, electric intensity, electrical potential energy, etc. Note 
that in every case the reasoning is in terms of “small” or “infinitesimal” changes 
but that the result is a derivative or integral expressed in terms of differentials. 
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ANSWERS TO EXERCISES 


(a) through (d) There are still other possibilities! (6) dy = 4, 0.4, 0.04; 
Ay = 5, 0.41, 0.0401. (f) dy = 2, 1, 0.5, 0.1, 0.01; Ay = 1.099, 0.693, 
0.405, 0.095, interpolation is not accurate for In (1.01) in Table IV, but a more 
accurate table yields 0.00995. (g) y+ Ay = y+ dy = f(a) + df(z), with 
f(z) = x?,x = 2, ἀχ = 0.000001; 4.000004. (Check error by squaring 2.000001 !) 

(h) f(z) = Ina, dx = 0.0035, x = 2; y + dy = 0.693 + 0.00175 = 0.695. 
(i) ο΄. (Ὁ —2(1 — x) dz.  (k) Rule for multiplying fractions. Yes, it is 
just the chain rule! (1) dF(x) = f(x) daz, since F’ = f, and we interpret dz 
as ΔΖ here. (n) It is the same as F(x) = fF’(x) dz, which is true by defini- 
tion. It suggests that F(x) is the sum of the changes in F(z). 

(o) Π at any time is the sum of the increments to the weight from the begin- 
ning to the time. An increment is the rate of change of W times the time in- 
terval. This amounts to thinking of a continuous process in terms of small 
finite changes. (p) It is a mixture of the two notations. A differential always 
ends with the differential of some variable; Ὁ # d!  (q) We indicated two, of 
which the first was the special case of the second for dx = 1. (r) The first 
differential is a function whose domain consists of pairs of real numbers, so this 
would not make sense. 


ANSWERS TO PROBLEMS 


1. When dx = 1. 3. 1. 5. Divide through by w. 7. dx = —0.007; y + 
dy = 80.874. 9. 0.43, the same for both methods to two significant figures. 
11. 2x cos χ dx. 13. cotadz. 15. dr = —sin y dy + e9”g'(x) dx and solve 
for dy. 17. Here we are not interchanging variables, so (7-8-5) becomes Df*(y) = 
D,x« = 1/Df(f*(y)) = 1/Df(z), since f*(y) = x and y = f(x). 21. Find differ- 
ential of quotient and divide by the quotient. 


*7-18 Manipulation of integrals. In Section 7-9 we indicated that in 
practice integrations are performed by using tables of integrals. In this 
section we discuss some of the manipulations that are involved. 

In any table of integrals we find the formula 


dx 
(1) aS = Aresin ἃ a ee 


Of course, this expression is by definition synonymous with (2) and (3). 
(2) Ὀς Aresinz = (1 — ὡ Ὁ) "2, 
(3) ἃ Arcsin x = (1 — Ἱὔὗ "2 de. 


Forms (2) and (3) have the advantage of being identities to which we are 
sure that we may apply the Rule of Substitution, whereas the left member 
of (1) stands for any one of many formulas and the equality means that 
any antiderivative of (1 — x?)—'/? is Arcsin x plus some constant. 
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(a) Why [1| < lin (1)? (b) Write (2) using the identity function J, with- 
- out using the independent variable. %*(c) Τί you did not do Problem 11 in 
Section 7-8, derive (2) by writing z = sin ἡ, D,r = D,sin y, 1 = (cos y)Dzy, 
and observing that cos y = (1 — sin?y)!/*. (4) Why do we take the positive 
square root in this last sentence? | 


Suppose we wish to find the antiderivative of (1 — 412) 15, that is, 


(4) [ τ ξδ τς ἐν 
J Μ -- 422 
Comparing it with (1) we might be tempted to complete (4) by sub- 
stituting (x:2x) in the integrand in (1) to get the alleged indefinite integral 
Aresin 2x. But D Arcsin 2x = 2(1 — 4.52) 12 by (2) and the chain rule, 
so that this result is wrong. Of course, we see how to correct it by a factor 
of 2 to get the correct integral (1/2) Arcsin 2x. However, the fact that our 
substitution gave a wrong result indicates that either it or our notation Is 
faulty. 

The error will become clear if we substitute (7:2) in (2) and (3) to get 


(2) De, Aresin 2x = (1 — 42?)71/?, 


(3’) ἃ Arcsin 2x = (1 -- 412) 112 d(Qx) = 2(1 — 427)71/?. 


The correct result can be found from (2’) by noting that D,Aresin 2x = 
D.,Aresin 220,22, or from (3’) by dividing both members by 2. This sug- 
gests that we would have obtained the correct answer had we substituted 
in (1) (v:2x) throughout, including in the differential. In short, we would 
not have gone astray here if we had followed the Rule of Substitution 
mechanically. 

This may not seem surprising, since we expect that a correct application 
of the Rule of Substitution would not lead to error. However, the ex- 
pectation is justified only if our notation is well conceived. Suppose, for 
example, that we write (2) DArcsina = (1 — x7)~'/? without explicit 
indication of the variable of differentiation. Then substitution throughout 
here would lead to Ὁ Aresin 2x = (1 — x”)—'/?, which is incorrect if the 
differentiation is supposed to be with respect to x. We see that the Rule 
of Substitution in integrals may lead to error unless the differential nota- 
tion is used or the variable of integration is indicated explicitly and sub- 
jected to substitution throughout. 


(e) Is the same caution applicable to derivative formulas? 


The following theorem indicates the way in which substitution can be 
used safely in integrals. 


(6) [[1 de = F@)] o[ [Τω)σ dx = FO) |. 
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The proof is immediate if we note that the two members are synonymous 
with F’(x) = f(x) and D,F(g(x)) = f(g(x))g’(x). Since by the chain rule 
DF (g(v)) = F’(g(x))g’(x), the first member implies the second by re- 
placement. Also the second can be true only if F’ = f. But since dg(x) = 
g'(x) dx, (5) tells us that we can substitute in an indefinite integral formula 
for the variable of integration, provided we do it throughout, including the 
differential. 

Of course, we are not obliged to use the differential notation. We can 
write (5) in the form 


(5) ({f=F) (flay = Fla). 


Then we can get new integration formulas by applying (5’) with ap- 
propriate choice of g. Since f{Z] = f and F[J] = F, the symbolic manipu- 
lations will be similar to those using the differential notation, though 
somewhat more concise. However, since the differential notation 15 almost 
universal, we shall use it below. The reader may find it instructive to 
repeat the work, using the names of functions instead of names of images 
and differentials. 

In applying (5) it is customary to use a new variable in the substitution 
(which amounts to expressing the integrand in parametric form), carry 
through the integration, and then resubstitute to express in terms of the 
original variable. Thus we would compute (4) as follows. 


d(t/2) 
= {| —— “5 v= t/2 
- ae — 4.2 =| Fae — 4(t/2)2 eae 
(7) = (1/2) f a sieeld G23 a2 
(8) = (1/2) Aresin ¢ (1)(z2) 
(9) = (1/2) Aresin 2x (ee D0): 


We chose the transformation « = t/2 so that we would get an integral 
identical with (1) except for the variable used. We thus transformed the 
integral to make it like the known formula, rather than the reverse. This 
is perfectly justified by the logical equivalence in (5). Also, (5) justifies 
the equation (6) because by (5) the left member equals some F(x) if and 
only if the right member equals F(t/2), and these are 6 identical for all x 
and tif « = ¢/2. 

As a more complicated example, consider the following manipulations, 
where we choose x = sin ¢ because it promises the elimination of a square 
root. 
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(10) fa — 7)? dy = fa — gin?t)!/? cos t dt (4 = sind) 
(11) = [cos*t dt 

_ f¢ 1+ cos 2t BA atin rice 
(12) = (a (4-12-25) 
(13) = (:) +- (1) sin 21 (check by differentiating) | 
(14) - (4) +(3)s i cos t 

=\5 5) sin t cos 

| 1 ἱ 1 ------ς 

(15) = (2) Aresin x + (2) rV 1 — 2x? 


(since + = sin 2). 
(f) Justify (14). 


In these manipulations there are certainly some dubious steps. ΕῸΓ 
example, if = sin ¢, it does not follow that (1 — x”)? = cost. The 
most we can say is that cost = +(1 — 2x7)"/?. Nor does it follow that 
i = Arcsin x. However, such matters are not important 7f we check the 
final result by differentiation. The manipulations are to find an integral; 
only the differentiation proves that the discovery is correct. In the above 
case we have 


(16)  D(1/2) Aresin « + (2). — 2?)1? 
(1/2)(1 — 2?)7* 

+ (1/2)(1 — 2”)? 

+ (1/4)2(1 — 2?)~'/?(—22) 
(17) ss Ci eg 


| 


(g) Simplify the expression in (16) to find (17). (h) What would happen 
if we had chosen the negative square root? (i) Does this show that our final 
result holds only for certain values of x? 

Evaluate the following integrals, using the form (6) through (9) and (10) 
through (15): (ἡ [( — 922)~!/2dz, = (k) (( -- 4a7)!/4dz. 

(1) Repeat (10) through (15), but using the substitution 1 = cost. 


PROBLEMS 


#3 


1. To the question, “What is an indefinite integral of cos x?” a student gave 
[Ὁ cos x dz as an answer. Is this correct? Do you think this is what the professor 
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wanted? Can you formulate the question more precisely so that the student 
could not dodge the issue so easily? 


2. Show that 
(18) i cf = ec [ f, 
(19) [στὸ = {τὸ[ν 


and explain precisely what the laws mean. 


In Problems 3 through 10, make the indicated substitution, find an anti- 
derivative in terms of the new variable, and then express in terms of the original 
variable according to the pattern in the section. Make a list of integral formulas 
by utilizing previously found formulas for derivatives, and add to it as you go 
along. In this way you will make a small integral table of your own. 


3. [@ — 07)? 4., x= asin t. 


4. [τ-ὐ + Or det = ἀκ ee 
x dx 2 2 
ὅ. faygin + τ΄. 


6. fra Seay? Gey Spee ay, 


dx 
‘“jJl—z 


wy 


ia 1 eee, 


8. [ane de,t = COs 2. 


9. [cote da, t = sing. 


10. { de sf = 1-Ἐ Ζ. 


1—ez 


*11. Justify (20) by differentiating the right member. 
dx 1 at 2 
(20) faster am ᾿ 


α-- ὦ 

* 12. Show how to find (20) by using the fact that (a? — «221 = (1/2a) 
((a-+ 2) 1 + (ἃ — α), ἢ. 

*13. Find a formula for f(z? — a”) dx. 

* 14. Justify (21). 


(21) [see z dx = In [566 x + tan αἱ. 


*15. Find a formula for f ese x dz. 
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* 16. Show that 
(22) Ὁ Arctan = (1+ 122.-|, 


* 17. Justify (23). Show how it might be found by using (22). | 


(23) [ -τς = (1/2) Arctan (x/a). 


18. Justify (24). 
(24) [@ 4 a)? de = In le + V2? + aI. 


19. Show how (24) might be found by the substitution ὁ = a tan z and the 
use of (21). 
20. Explain and justify (25) 


(25) [ udv = εὖ — [ υ du (Integration by parts). 


21. Use (24)(u:2, v:sin x) to find fx cos z dz. 

22. Find f In x dz by applying (25)(u:In g, v:2). 

23. Find fae" dz. 

24. Argue for (26) from the definition of the two members as limits of sums. 


αἴ 
(26) [faa = fi, Hood at 


25. Argue for (26) by using the fundamental theorem of integral calculus. 
26. Is the right member of (26) what we would obtain by the substitution 
(x:g(t)) in the left member? 


ANSWERS TO EXERCISES 


(a) Otherwise the integrand is imaginary or undefined. (Ὁ) Ὁ Arcsin = 
(1 — J?)-1/2, (4) Since y = Aresinz, y © (--πῶ 2/2) and cosy > 0. 
(e) Yes. No trouble arises if we use identities in terms of the functions them- 
selves, but if we write formulas in terms of images, the variable needs to be 
indicated explicitly. (Ὁ (4-12-22). (g) Do it! (h) If we had done so in 
both (11) and (15) we should have found the same final result. (4) No. Our 
check shows the contrary. (j) (1/3) Arcsin 85. (k) (1/4) (Aresin 25 ++ 
24V 1 — 427). 


ANSWERS TO PROBLEMS 


1. Yes, it is correct. No, he wanted an expression for this antiderivative in 
terms of familiar functions. Possibly “Express the following in terms of poly- 
nomials, trigonometric functions, exponentials, or logarithms, without any in- 
dicated integrations and in as simple a form as possible.” 2. If DF = f, then 
DcF = cf, and conversely, and similarly for (19). 3. Arcsin (χα). 

5. (1/2) In (a? + 12). 7. —In|1 — a]. 9. In |sin αἱ. 
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FURTHER READING 


This chapter was designed to give the reader an appreciation of the basic ideas 
of calculus and a minimum skill in its use. However, we could only touch the 
edges of the vast amount of theory and technique that has developed in this 
area during the last several hundred years. The student who continues mathe- 
matics will undoubtedly take a course that concentrates on calculus. Those who 
are unable to do so may find the following references helpful. 

Introductory Calculus with Analytic Geometry, by Edward G. Begle, Henry 
Holt and Co., 1954. In spirit and notation very similar to this book. 

Calculus and Analytic Geometry, by George B. Thomas, Addison-Wesley, 
1951. Contains a large and interesting collection of problems. 

Calculus, by George E. F. Sherwood and Angus E. Taylor, Prentice-Hall, 1954. 

Differential and Integral Calculus, by R. Courant, (two volumes), Interscience 
Publishers, 19387. The great classic textbook on elementary calculus. Difficult 
but worth the effort. 


*CHAPTER 8* 
PROBABILITY 


8-1 The nature of probability. The word “probability” is used with 
many different meanings and connotations. For example, in “He is prob- 
ably happy” it suggests a degree of belief or confidence. On the other 
hand, in “The probability of throwing a seven with two dice is one-sixth” 
it refers to an expectation that if we throw two dice many times, the total 
of seven will occur about one-sixth of the time. 

The mathematical theory of probability has been developed to deal 
with problems of the type suggested by the second example above. The 
typical problem dealt with by the theory takes the following form: We 
know (or assume) the probability of certain everts; that is, we know how 
often these events occur. We wish to calculate the probabilities of certain 
related events in order to be able to predict how often they will occur. 
Such problems arise, of course, only when we are dealing with events 
_ whose occurrence we cannot predict with absolute certainty. 

Because of the various connotations of “probability,” there exist dif- 
ferences of opinion among philosophers and mathematicians as to the 
meaning of the mathematical theory of probability. However, there is 
virtually unanimous agreement about the fundamental laws and methods 
of calculation. It is the purpose of this chapter (1) to familiarize the 
reader with the fundamental ideas of probability, (2) to show how the 
theory of probability may be derived from a few simple axioms, and (3) 
to indicate how the theory of probability may be applied to a wide variety 
of situations. Difficult proofs are omitted. 

Let us imagine a situation in which any one of a set of events may 
occur. For example, if two dice are thrown, the set of events consists of 
all the different ways that they may come to rest. Letting a = the num- 
ber that comes up on the first die and b = the number that comes up on 
the second die, an outcome is an ordered pair (a, b). The 36 possible events 
are sketched in Fig. 8-1. Long experience by countless experimenters 
indicates that with “fair” dice each of these 36 outcomes occurs about 
equally often. Accordingly, it seems reasonable to say that the prob- 
ability of any one of them is 1/36, i.e., each may be expected to occur 
about 1/36 of the time. 


* Chapter 8 is prerequisite for Chapter 9 but not for Chapter 10. 
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Figure 8-1 


As a rule we are interested not so much in the probability of just a single 
one of the possible outcomes as we are in the probability that any one of 
a set of outcomes occurs. For example, we may be interested in the prob- 
ability that the sum of the faces of the dice is 7. In the figure we note that 
this may happen in 6 ways, 1.e., the subset defined by a + ὃ = 7 has 6 
members. Since each has a probability of 1/36, it seems reasonable that 
the probability that one of them occurs is 6/36 or 1/6. This seems plausi- 
ble because if each occurs about 1/36 of the time, we may expect to observe 
one or another of them about 6 times as often. 


Find the probability that the total of the dice is: (a) 2, (Ὁ) 3, (ce) 4, 
(d) 5, (e) 6 (ὃ 7, (g) 8 (8) 9, G10, GQ) HU, (k) 12, (ἢ) 19. 


Along the lines suggested by the previous example, we always think of 
a probability situation in terms of sets. We visualize a situation (experi- 
ment, observation) as a set of conceivable outcomes. We refer to this set 
of conceivable outcomes as a sample space and to the outcomes as poznis. 
In the experiment of throwing dice the sample space consists of 36 points. 
We refer to each set of points of the sample space as an event. An event 
may be a singleton consisting of just one point, or it may consist of more 
than one point. For example, the event that the sum of the dice is 2 has 
just one member, (1, 1), whereas the event that the sum is 3 has two 
members, (1, 2) and (2, 1). 

Since we think of probability situations in terms of a sample space, and 
of each event as a subset of this sample space, we may apply all the ideas 
of set theory. Letting our sample space be the universe of discourse U, 
the null set @ represents an impossible event (one that contains no mem- 
bers of the sample space); the complement A’ represents the event that 
consists of A not occurring; A ὦ Β is the event that consists of either A 
or B or both occurring; and A Ω B is the event that consists of both A 
and B occurring. 
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Referring again to the dice and Fig. 8-1, let A = {(a,b)|a - ὃ = 7} = 
the occurrence of a total of 7, and B = {(a,b)|a + ὃ = 11} = the oc- 
currence of a total of 11. Then A’ = the nonoccurrence of 7, Α ι! B= 
the occurrence of either 7 or 11, and A ἢ B = the occurrence of both 7 
and 11. Evidently in this case A ὦ B = @, which corresponds τ the 
fact that 7 and 11 cannot both turn up at the same time. 

Usually probability problems (in real life as much as in books) are not 
stated in terms of sample space and its subsets. For example, what is the 
probability that two dice come up with a total of 7 or 11? Letting P[X] 
stand for the probability of X, this may be translated into set terms as 
a request for P[A U B]. By counting the points in the figure, we con- 
clude that P[A U B] = 8/36 = 2/9. It seems reasonable to write 
ΡΙΑ n Β] = 0. Similarly, the probability that one die or the other 
shows 6 is P{(z, y)|z& = 6 V y = 6} = 11/36, as the reader can find 
by counting the points in the indicated set. Note that in this case one can- 
not simply count the points where x = 6 and the points where y = 6 
and then add, since these two sets overlap. 

If an event (subset of the sample space) is defined by a simple sentence, 
it is customary to use the sentence in place of the full name of the set. 
Thus we have P[the sum of the dice is 7] = Pix + y = 7] = P[A] = 


Find the following by reference to Fig. 8-1: (m) The probability that the 
sum of the dice is 8. (Ὁ) The probability that the sum is either 2 or 3. 
(0) Pile-+y = 4. (p) Ple —y = 2... (q) Ple =2V a = 4. 

(r) Piz =2V y= 3). (s) Plr ¥ νυν] (t) Pile = ψ. (u) The prob- 
ability that one of the dice shows twice the other. (v) The probability that 
both dice show an even number. (w) The probability that both show an odd 
number. (x) The probability that neither shows anevennumber. (y) P[2 < 
ety S12). (ὦ Plet+y « 2]. 


ANSWERS TO EXERCISES 


(a) 1/36. (b) 1/18. (ὁ) 1/12. (d) 1/9. (6) 5/36. (ὃ 176. (g) 5/36. 
(h) 1/9. (i) 1/12. (j) 1/18. (k) 1/36. (Ὁ 0. (m) Pilx+y = 8] = 5/36. 
(n) Pla+y =2V 2e¢+y = 3) = 1/12. (0) 1/12. (p) 1/9. (q) 1/8. 
(r) 11/386. (8) 5/6. (t) 1/6. (u) 1/6. (v) 1/4. (w) 1/4. (x) 3/4. (y) 1. 
(2) 0. 


8-2 Fundamental laws of probability. To construct an axiomatic 
theory of probability we imagine a universe of discourse U which we refer 
to as sample space and view as the set of all conceivable outcomes of an 
experiment. Each of the subsets of U is called an event. We wish to as- 
sociate with each event X a probability P[X] so that the following axioms 
are satisfied. 
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(1) P(X] € Re, 

(2) P[X] = 0, 

(3) (AN 8Β -- 6) -» (PIA U B] = P[A] + PIB), 
(4) P[U] = 1. 


These axioms assert that the probability of any event is a non-negative 
(2) real number (1), that the probability of either of two disjoint events 
is found by adding their probabilities (3), and that the probability of the 
sample space is 1 (4). 

The reasonableness of (3) is apparent from the examples relating to 
two dice considered in Section 8-1. Letting P(A) = P(x = 6) and 
P(B) = Pty = 6), we have P(A) = P(B) = 1/6, but P(A UB) = 
Pia =6V y= 6) = 11/86 » 176 - 1/6. Here A is not disjoint 
with B, and we see that the right member of (3) fails. However, letting 
C= (yi e= δ); ῬΙ6] = 1/6and δι ὃ CC) =—]FPe=6 Ve=—] 5) — 
1/3 = P[A] + P[C). Here A//C and the right member of (3) holds 

with (B:C). 

The probability associated with a set appears to be a kind of measure 
of the set. Accordingly, we would expect that the properties (7—10-1) 
through (7-10-4) would hold. We already have (7-10-2) and (7-10-3) 
included in the above axioms. We shall derive (7-10—-1) and (7-10-4) 
below. | 

To derive 


(5) ΡΙΟῚ = 0 


we note that 1 = P[U] = Ρίῦ ὦ 6] = Ρ[Ὁ] + P/M] = 14+ POM) 
since U and @ are disjoint. From 1 = 1 + P[@], we have (δ) immedi- 
ately. We interpret @ as the impossible event, the event that consists in 
the occurrence of no one of the set of all possible outcomes of the experi- 
ment. Hence (5) says that an impossible event has probability zero, that 
is, A = ὁ — P(A) = 0. The converse of this, however, cannot be proved 
from (1) through (4). Actually there are situations in which an event 
that is possible has probability zero. I*or example, the probability that 
a point chosen at random on an axis 15 at a rational point 1s zero. The 
proof of this involves considerably more mathematics than is contained 
in this book, but it may be made plausible by considering how the 
irrationals are much more tightly packed on the line than the rationals. 

Since A and its complement A’ are disjoint, we have P[A] + P[A’] = 
PLA ὦ A’] = P[U] = 1. Hence 


(6) Pi A’) = 1 PA). 
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Since A’ occurs if and only if A does not, (6) says that the probability 
that an event does not happen is 1 minus the probability that 1t does 
happen. For example, referring to the two dice in Section 8-1, 


PB<aety<MW=—-1—Prty=2V2r+y= 12]. 


The calculations suggested by the right member are easier than those sug- 
gested by the left. 


(a) Find P[3 < x+ y < 11] by adding the seven probabilities. (b) Get 
the same result by using (6). 

Use (6) to find the probability that: (c) 5 does not come up when one die 
is thrown, (d) 5 is not the total when two dice are thrown, (6) the sum of 
the faces is greater than 3 when two dice are thrown. 


We now derive 


(7) ACB— P(A] < PIB], 


which is just (7-10-4). To do so we note that ACB oB= AU 
(A’ ἡ B), where A’ B and A are disjoint. Hence P[B] = P[A] + 
P[{A’ nN B] => PIA], since P[A’ ἢ 8] = 0. 


(Ὁ Justify each of the statements in the proof of (7) (6) Prove (8) by using 
(4) and (3-8-4). 


(8) ΡΙΑ] < 1. 
Axiom (3) covers only the case of disjoint sets. A more general law is 
(9) P[A U B] = P[A] + P[B] — P[A ἡ BI. 


To prove it we note that A U B = (A Ω B’) UB, where AN B’ and B 
are disjoint. Hence P(A ὦ Β) = P(A NB’) + P(B). Now we note 
that (A N B) U (AN Β΄) = A, where (A ἢ B) and (A ἢ B’) are dis- 
joint. Hence P[A ἢ B] + P[A ἢ B’] = P[A]. Combining these equa- 
tions we arrive at (9). This law will be very useful as soon as we have a 
procedure for finding P[A m B]. The necessary theory is included in Sec- 
tion 8-5. 


(h) Justify the set identities and do the final manipulation. 


We may summarize the laws of this section as follows. (1), (2), (8): 
The probability of an event is a real number in the closed interval from 
zero to one. (5), (4): An impossible event has zero probability and a cer- 
tain event has probability of one. (3): Probabilities of mutually exclusive 
events are additive. (7): The probability of an event cannot be greater 
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than that of an event that always occurs whenever the first one does. 
(9): The probability that either one of two events occurs is the sum of 
the probabilities that either one occurs less the probability that both 
occur. : 

We have already pointed out that there is nothing in the axioms to pre- 
vent a possible event (a non-null set) from having zero probability. Simi- 
larly there is nothing to prevent a noncertain event (one not equal to the 
universe of discourse) from having a probability of one. We may write 
(4) as A = U — P[A] = 1. The converse of this is not true. We shall 
give examples in later sections. 


PROBLEMS 


1. Show that if A//B, B//C, and A//C, then P[A U BUC] = P(A) + 
ΡΙΒῚΊ + P[C}. 

2. By induction prove a similar theorem for any number of mutually disjoint 
sets. 

3. Prove that 4 ἢ BOC = 01s not a sufficient condition for P[AU BUC] = 
ΡΙΑῚ + P[B} + PIC. 

4, State and prove a theorem that generalizes (9). 

5. Prove (10). 


(10) P[A UB] < PLA] + PIB]. 


6. Illustrate the laws of this section by reference to the example of Section 
8-1. | 


Problems 7 through 12 refer to the situation of two dice described in 8-1. 


7. What is the probability that the two dice come up the same? 
8. What is the probability that one die is even and the other odd? 
9. That both dice are even? 

10. That one die is even or that the dice come up the same? 

11. That the sum of the faces is 5 and that one face is even? 

12. That the sum of the faces is 5 or that one face is even? 


13. Under what conditions does the equality hold in the right member of (7)? 

14. Generalize (10). 

15. Statistics indicate that there are 13 persons injured by tornados for every 
one killed by them. The figures for Arkansas show that a person living 68 years 
there has one chance in 1,490 of being killed, one in 115 of being injured. What 
is the chance of being involved through injury or death in a tornado in Arkansas? 


Prove and interpret 16 through 19 verbally. 


16. P(A NM B] < P[A]. 

17. P[AU A] = 1. 

18. P[AN A’ = 0. | 

19. P[AN B] = P[A] — P[AN B’]. 


*% 20. How many members are there in the domain of P? 
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ANSWERS TO EXERCISES 


(a) 17/18. (c) 5/6. (4) 8/9. (e) 11/12. (f) AU (A’NB) = (AUA)N 
(AUB) = UN(AUB) = AUB =B by (8-8-12) when ACB. The 
disjointness follows from AM (A'’N B) = (AN A)NB=ONB = ὁ. 
(g) AC Uand P(U) = 1. Then (7)(B:U). (h) Use laws of Section 3-7 as 
above. Compare with the proof of (6-3-3). 


ANSWERS TO PROBLEMS 


1. P[AU(BUC)] = P[A]+ P[B UC] since A//B and A//C imply 
A//(B UC). 3. For two dice let A, B, and C be respectively the events that 
the first die shows 1, the sum of the dice is 7, and the points on the two dice are 
the same. Then the left member is 4/9 and the right member is 1/2. 5. From 
(9), since P[A ἢ 8] > 0. 7. 1/6. 9. 1/4. 11. 1/9. 18. When P[A’ ἢ B] = 0. 
(It is not necessary that A = B.) 15. About 0.0093. 17. (38-7-19). 
19. (3-7-24), the terms in whose right member are disjoint. 


8-3 Relative frequency. In the previous section we assumed that P 
is a function whose domain is some collection of subsets of some universe 
and which satisfies the stated axioms. We did not further define P, and 
indeed we are free to specify P in any way we wish as long as the axioms 
are satisfied. 

In particular, if the sample space is a finite set, it suffices to assign a 
non-negative probability to each point in such a way that the sum of all 
probabilities is 1, and then let the probability of any event be the sum of 
the probabilities associated with its points. For example, suppose that a 
die is loaded so that the probabilities of points from 1 to 6 are 1/6, 7/36, 
1/9, 5/36, 2/9, 1/6, respectively. Then the probability of getting an even 
point is 7/36 + 5/36 + 1/6. 

In this situation what is the probability that the point is: (a) less than 3? 
(b) odd? (c) even or divisible by 3? (d) greater than 1? 

(6) Let the sample space consist of πὶ points a1,..., ας with which are asso- 
ciated probabilities pi,..., Pn, such that p; > Ὁ and dip; = 1. Let P[X] = 
>_p;, where the summation is carried over all 7 for which a; Ε X. Show that 
the axioms are satisfied. 


In a very large number of situations the sample space is chosen so that 
the probabilities associated with the sample points are equal, i.e., the 
sample points are equally likely. Letting 9t(X) be the number.of members 
of X, which we call the frequency of X (see Section 6-2), we see that the 
probability associated with each point is 1/9t(U), i.e., the reciprocal of 
the number of points in the sample space. Then the schema of Exercise 
(e) suggests the following definition of probability. 


(1) P[x] =a W(X) /N(U). 
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With this definition, the probability of an event is simply the number of 
favorable cases, 9((X), divided by the total number of cases, 91(U). It 
is called the relative frequency of the event. 


(f) How do we know immediately from Exercise (e) that (1) yields a prob- 
ability satisfying the axioms? (g) Without relying on Exercise (6), show that 
(1) satisfies the axioms. (h) Why is it not necessary to prove (8-2-5) through 
(8-2-10) directly from (1)? (i) Nevertheless, prove (8~-2—5) through (8—2-10) 
directly from (1). 

(1) According to the American Experience Table of Mortality, of 90,471 
people living at age 23, 45,291 live to age 67. State this in probability terms. 
What is the sample space? What is the set corresponding to the probability? 
What does this result mean for a particular individual age 23? (k) In 1952, 
46.8% of the auto accidents resulting in deaths and injuries involved exceeding 
the speed limit. State this in probability terms after indicating the universe of 
discourse. 


Most probability statements that are made in daily affairs and in 
science can be reformulated precisely in relative frequency terms, but some 
care may be required in order to choose the sample space properly. In 
some cases probability statements are merely ways of reporting an actual 
relative frequency that has been observed. Exercise (k) is an example of 
this. In other cases the probability is thought to indicate the relative 
frequency with which the event will occur “in the long run.” Thus Exer- 
cise (j) reports an observed relative frequency, but it also suggests the 
proportion of persons 23 years old who may be expected to live to 68. 


(1) “In Iowa...there is one chance in 1,203 that any given square mile will 
be struck by a tornado in any one year.” What does this mean? 


To calculate probabilities from (1) we need to be able to find the fre- 
quencies involved. Usually the problem takes the form: how many ways 
can such and such an event take place? For example, to calculate the 
probability of getting a one-suit hand in the game of bridge we need to 
know the total number of different possible hands and the number of these 
in which one gets 13 cards of one suit. The branch of mathematics that 
deals with such questions is called combinatorial analysis. Its fundamental 
law is: 


If a certain event can take place in ny ways and then another 
(2) in ng ways, then the ordered pair of events can occur in Ἠ 172 
ways. | 


For example, one die can come up in 6 ways, a second in 6 ways, and the 
two together in 36 ways. This law is really only a different way of stating 
(6-3-6), since we can think of each different way of doing the two things 
as an ordered pair of a way of doing the first and a way of doing the second. 
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(m) In how many ways can two houses be painted if the first house can be 
painted three ways and the second four? (n) In how many ways can three 
names be assigned to three houses, so that no two have the same name? (0) In 
how many ways can three dice come up? 


In statistics a set of objects under study is often called a population. 
An ordered set (a1, @2,..., ἃ.) whose components are drawn from a pop- 
ulation is called a sample. (Often the sample space is a set of samples 
drawn from a population, and this is the origin of the name.) We may 
visualize a sample formed by choosing members of the population one 
after the other. If the method of choice is such that all possible samples 
are equally likely, we call the result a random sample. 

One method of drawing a random sample consists of choosing an object 
(so that each object has equal probability) and then replacing the object 
before making the next choice. This is called sampling with replacement. 
Suppose the population has nm members and we wish to draw a sample of r 
members. Then the first element in the sample can be chosen in n ways. 
So can the second and all succeeding elements, since the original situation 
is restored after each choice. Hence, from (2), 


(3) In sampling with replacement, the number of different samples 
of r members chosen from a population of n members 18 n’. 


(p) How is (2) used in (3)? 


For example, how many different sequences of 10 symbols can be 
formed with the plus and minus signs? Here n = 2, since the population 
is {+, —}, and r = 10. Hence the number of sequences is 2'°. Of these 
only one consists of all +’s and only one of all —’s. Hence the probability 
that all symbols are the same is 2,219 or 1.29, 


(q) How many different sequences of four symbols can be formed from the 
digits, where a digit may be used more than once? 


A second method of forming a sample is to choose a first element, then 
choose the second element from the remaining elements, and so on. This 
is called sampling without replacement. Since the first element can be chosen 
in n ways, the second in n — 1, and so on, it is easy to see that the number 
of samples without replacement is n(n — 1)(n — 2)... (ιυ — r+ 1) or 
n\/(n -- r)!. This number is often called the number of permutations of 
n things taken r at a time. It is the number of different ways we can choose 
and arrange r things from among n things. A common symbol for it is 
Py. 


(4) P? = ni/(n — yl. 
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For example, in how many ways can we choose a president, vice-presi- 
dent, and secretary from among ten people? Here r = 3, n = 10 and 
Pl? = 10-9-8 = 720. When n = r we have the number of arrange- 
ments of n things, that is, P;, = n! For example, a squad of eight soldiers 
may be lined up in 8! ways. 


(r) In how many ways can a sergeant and a corporal be chosen from a squad 
of 12 men? (8) How many ways can the squad be arranged in a line? 


To calculate probabilities from (1) we find the frequencies of the event 
and of the universe and divide. For example, to find the probability that 
eight soldiers of different heights will line up in order of their heights from 
right to left if they are arranged at random, we note that there is just one 
way to arrange them correctly, i.e., the frequency of our event is 1. The 
frequency of the universe is 8!. Hence the required probability is 1/8!. 


(t) What is the probability that in six throws of a die the numbers 1 through 
6 turn up in that order? (ἃ) If a student is to match three items on a test 
with three items from a list of five items, what is the probability that he will 
succeed by chance alone? 


PROBLEMS 


1. Extend (2) to s events and prove your result. 

2. The A auto company offers 16 body styles, three different engines, 50 
items of optional equipment, and 286 exterior and interior color variations. The 
B company offers 16 basic body styles, 22 optional items of equipment, and 20 
colors offered in up to three possible combinations on each auto. For each such 
color or combination, there are two interior colors. (Data from Wall Street 
Journal, March 21, 1955) Which company makes the greatest number of dif- 
ferent kinds of cars? 

3. A psychologist wishes to present the subjects of an experiment with three 
different stimuli simultaneously. If the first stimulus has three possible values, 
the second 15, and the third 50, how many different combinations can he present? 

4. In how many ways can a football team of 11 be assigned to the positions? 

5. In how many ways could a first-string football team be organized from 
among 12 men? 13 men? 22 men? 

6. In how many ways can the ten digits be arranged in a line? How many of 
these give integers written in the usual way? 

7. Studies of chess play have shown that in typical positions there are about 
30 legal moves. Hence for two moves there are about 10° possibilities. Why? 

8. If a typical chess game lasts about 40 moves, about how many different 
games are possible? 

9. Suppose that an electronic brain were to try out all possible chess games in 
order to see the final result of each, and suppose that it can go through a million 
different games in one-millionth of a second. How long would it take the machine 
to try out all games? 

10. Suppose a rat is run through a T-maze in which it has the choice of turning 
right (2) or left (1), and suppose that & brings reward and LZ a punishment. 
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Suppose that we consider the rat to have “learned” if it makes the correct choice 
five times in succession. We let the rat run 20 times to see whether he is able to 
learn in that number of trials. Let U be the set of all different ways in which the 
rat can run these 20 trials, i.e., all different sequences of 20 R’s and L’s. What 
is 1(U)? Let X be the event of the rat’s having learned. Describe X as a sub- 
set of U. (Stochastic Models for Learning, by R. R. Bush and F. Mosteller) 

11. Suppose that in a batch of 100 light bulbs there is one defective bulb. 
What is the probability of picking it at random? 

12. Suppose in this situation we pick a random sample without replacement 
of five light bulbs. How many such samples might we pick? How many of them 
contain the defective bulb? What is the probability that a random sample of 
five bulbs contains the defective one? 

13. Suppose that there are 20 male rats and 20 female rats, some of which 

are cancerous. The 40 rats are divided into four disjoint subsets, the male can- 
cerous, female cancerous, male noncancerous, and female noncancerous. The 
number of members in these subsets is known as soon as we know an ordered 
pair of numbers (z, y) giving the numbers of male and female cancerous rats. 
We consider a sample space consisting of all the points (7, y). How many mem- 
bers does it have? 
%14. For examples in which the points in a sample space do not have equal 
probabilities, see the article by L. Guttman in Psychometrika, Vol. 11, 1946, 
pp. 81-95, and the article by A. P. Horst in the Journal of Educational Psy- 
chology, Vol. 24, 1933, pp. 229-232. 

15. Suppose four objects are ranked by an observer, Le., given ranking num- 
bers from 1 through 4. We are interested in a particular pair of objects and we 
record the ranks given to them as an ordered pair of numbers. For example, 
(2,3) would indicate that the objects were assigned the orders 2 and 3 respec- 
tively. How many ways can these two objects be assigned ranks? Assuming 
that each assignment is equally likely, what is the probability that the sum of 
the ranks is 4? 

16. Solve Problem 15 if there are six objects in which we are interested in a 
subset of three objects. 

17. Prove (4) by induction. 

18. Probabilities are often stated in terms of odds or chances, according to 
the following definitions. 


(5) The odds arer tosfor X = [P[X] = r/(r4+ 8)] 
(6) The odds are 8 tor against X = [P[X] = r/(r+ 8)] 
(7) The chances are r to 8 that X = [P[X] = r/s] 


Suppose that the probability of a coin coming up “heads” is 0.51. State this in 
terms of odds and chances. 

19. When two pairs of identical twins with the same first names appeared 
in the same graduating class, a professor said that the chances were 1,140,000 
to 1 against the coincidence. What did he mean? 

20. What probability corresponds to odds of two-to-one in favor? 
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21. According to an article in the bulletin of the Commonwealth Club of 
San Francisco of January 21, 1946, “When a section reaches a conclusion after 
long and careful study, chances are 13 to 1 that the club, if it takes a ballot, will | 
vote the same way.” What does this mean? 

*% 22. Referring to Exercise (u), what is the probability that the student will 
match just two out of three? 

* 23. Just one out of three? 

% 24. None at all? 


ANSWERS TO EXERCISES 


(a) 18/36. (0) 1/2. (ὦ 11/18. (d) 5/6. (ὁ) Immediate by con- 
sidering each axiom. (f) This is a special case of (6) with p; = 1/9U(U) for all 
2. (6) Consider each axiom. (38) follows from (6-3-2). (Ὁ) Since the others 
follow from the axioms, and we have shown the axioms apply. (4) Use (6-3-3) 
and other properties of cardinals of sets. (j) The probability that a person 
alive at 23 will live to be 67 or older is 45,291/90,471, or about 1/2. Sample 
space is a set of people alive at the age of 23, probably a set of insured Americans. 
Nothing; a particular person may be more or less likely to live that long, de- 
pending on health, ete. The probability does not refer to an individual but to 
our choice of individuals. If we choose individuals at random, about half of 
them will live to 67 or more. 

(k) Sample space (universe of discourse) is set of auto accidents resulting in 
deaths or injuries in 1952. Probability of the subset consisting of those in which 
speed limit was exceeded is 0.468. (1) About 1/1203 of the square-mile regions 
in Iowa are struck in any one year. Also, we would expect a given square mile 
to be struck about once in every 1,203 years. (m) 12. (0) 6. (0) 6% 
(p) Repeatedly, r times. (q) 104. (r) 132. (s) 12 (Ὁ 1/68 (u) 1/853. 


ANSWERS TO PROBLEMS 


1. Use induction. 3. 2,250. 5. 12!, 13!/2, 22!/11!. 7. 802 = 10%. 9. 104° 
years approximately. 11. 1/100. 18. 400. 15. 12; 1/4. 19. Probability is 
1/1,140,000,001, or approximately 1/10!9. 21. The rest of the context indicates 
that this summarizes the fact that during the past 10 years the committees and 
the club membership differed on about 1/14 of the questions on which both voted. 
However, in addition to reporting this observed relative frequency, there is the 
suggestion that one may expect it to continue, i.e., from the experience of the 
past 10 years one may estimate a relative frequency that will be likely to appear 
in the future. If we think of the past experience, the sample space is the set of 
all questions on which both club and a committee voted. If we think of the 
future, the sample space is an imagined set of future questions. 


8-4 Partitions. We have seen in the previous section that knowing 
how to find the number of samples (with or without replacement) of a 
given size that can be chosen from a given set is often useful in finding 
relative frequencies. Of course, not all problems in combinatory analysis 
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can be treated in this way, as is evident from the problems in Section 8-3. 
Indeed, one cannot expect to find formulas to fit every situation, and the 
student should be prepared to attack problems directly in terms of the 
fundamental law (8-3-2). However, many combinatorial problems can 
be handled in terms of the concept of a partition. 

Often we wish to break up a set into subsets in such a way that every 
object belongs to one and only one of the subsets, i.e., so that the union 
of the subsets is the whole set and every pair of the subsets is disjoint. 
More precisely, a class of sets {A1, A2,..., An} such that A;//A; for 
alli = jand A, U AgU...U A, = A is said to be a partition of A. 


(a) The process of assigning members of a set to subsets is often called classt- 
fication. When all members of the set are classified, the classification is called 
exhaustive. When no member is assigned to more than one subset, the classifica- 
tion is said to be mutually exclusive. Give an example of an exhaustive and 
mutually exclusive classification. (Ὁ) Give an example of a classification that 
is not a partition. (c) Did we use partitions in Chapter 7? 


A partition whose sets are ordered is called an ordered partition. Note 
that the sets are ordered, but not the members of the sets. Thus an or- 
dered partition is simply an ordered set (4, A2g,..., An) of subsets of 
some set where {A,, Ag,..., An} is a partition of A. For example 
{{1, 2}, {8,4}} is a partition of {1,2, 8,4} and {{3,4}, {1,2}} is the 
same partition. But ({1, 2}, {3,4}) and ({3, 4}, {1,2}) are different 
ordered partitions of {1, 2, 3, 4}. 

The simplest partition of a set A is {A,, Aj}, where the complement 
is taken with respect to A. We simply divided A into a set A, and all 
other elements of A. The corresponding classification is called a dichot- 
omy. Numerous problems in probability are expressible in terms of ordered 
dichotomies. For example, what is the probability that a person playing 
bridge gets all cards of one suit? Here our sample space is the set of all 
different 13-card hands that can be picked from a 52-card deck. How many 
members does this space have? Now choosing a 13-card hand amounts to 
deciding on an ordered dichotomy of the 52 cards into two subsets of 13 
and 39 cards respectively. Accordingly, to solve the problem we need to 
know how many different ways we can perform such a dichotomy. 

More generally, we are interested in knowing how many different or- 
dered dichotomies there are in which a set of n members is partitioned into 
a set of r members and a set of πὶ — r members. Often a set of r objects 
chosen (without regard to order) from a set of n objects is called a com- 
bination of n things taken r at a time. The symbol for the number of 
combinations 15 ΟἿ. 

To find a formula we make use of the fact that we know how to find. 
the number of permutations of z things taken r at a time, i.e., the number 
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of sets of r objects (ordered) that can be chosen from n objects. This is 
given by P? = n!/(n — r)!. Now in order to form a permutation we can 
first form a combination and then arrange this combination in some order. 
The first can be done in C? ways, the second in r! ways by (8-3-4). Hence 
Po = TIC and 


(1) Cr = ni/rin — ἢ}. 


Returning to ue problem of bridge hands, the number of points in the 
sample space is C?%. Since there is just one point in the set whose prob- 
ability we seek, te probability is 1/C?2 or 13!39!/52!, if we assume all 
points equally Likely. 


*(d) Estimate this to order of magnitude. (6) Show that 
(2) C= δε, 


(Ὁ What is the probability of getting any particular hand in bridge specified 
in advance? (g) What is the probability of getting four aces in a five-card 
hand dealt from a 52-card pack? (h) Calculate C2 and verify your result by 
writing down the two member subsets of {a, ὦ, c, αἰ, e}. 


The results for partitioning into two subsets can be generalized to par- 
titioning into k subsets. Indeed, 


The number of different ordered partitions of a set of n mem- 
(3) bers into k subsets with r1, ro, ..., Tr members is 


n'/rylralrs! cee rel. 


We prove (38) by induction. For k = 2, re = ἢ — ry; and-(3) reduces to 
(1). Hence (8) holds for k = 2. Suppose (3) holds for k subsets. Then a 
partitioning into k + 1 subsets can be obtained by first partitioning the 
n objects into k subsets with ry, ro, ... , κα, Tk + Tk41 Members, and then 
partitioning the last set into two sets of 7, and r,4, members. The first 
operation can be performed in n!/ry!ra!... (re + Tha)! ways, by the 
induction assumption, the second in (re -- Pineal ways. Multi- 
plying we get (3)(k:k + 1). 

(i) Why doesri + re+...+ 7% = n? 

To illustrate the use of (3), consider the number of different ways that 
four hands can be dealt in bridge, that is, the number of different ordered 
partitions of 52 cards into four subsets of 13 cards each. Note that it is 
an ordered partition that applies, since the same hands redistributed 
among North, South, East, and West would constitute a different situa- 
tion. According to (3), we have 52!/(13!)*. 
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(j) How many different ways can five-card hands be dealt to six poker players? 
(Note that we must partition the 52 cards into seven subsets, including the set 
left over after the deal.) 


We adopt the notation ΟἿ, +,,...7, for the formula in (3). 


Calculate: (k) C311; (1) C301; (m) C3 1: 

(n) What is the probability that one of six players is dealt four aces in a 
five-card hand? 

An experiment with two possible outcomes a@ and a’ is performed 10 times in 
order. (0) In how many different ways can it happen that a occurs three 
times and a’ seven times? (p) a five times and a’ five times? (q) a seven 
times and a’ three times? (r) a ten times? (8) Generalize to an experiment 
performed n times and give the number of ways in which a could occur x times 
and a’ n — x times. (t) How many different ways can x 0’s and n — zx 1.8 
be arranged in a straight line? 


α Stirling’s formula. As the reader has seen, the evaluation of prob- 
abilities often involved calculating n! for large n. This is facilitated by the 
following approximation, known as Stirling’s formula. 


(4) nl = (2π)} 32. the, 


Here 6 is the base of the system of natural logarithms. The formula 15 a 
good approximation in the sense that 


(5) lim (2π}}} 2 tt e—"/n! = 1. 


nwo 


Actually the approximation is always too small, but the percent error 
approaches zero, and for n = 10 it is already less than 1%. 


*(u) Compare the two members of (4) forn = 1, 2,5, 10, and 100. *(v) 
Use (4) to evaluate some of the frequencies and probabilities in problems of 
this section. 


PROBLEMS 


1. What is the relation between ΟἹ and (%)? (See Section 7-7.) 

2. Show that the number of combinations of n+ 1 things 7 at a time is 
equal to the sum of the number of combinations of n things 7 at a time and the 
number of combinations of n things ὃ — 1 at a time. 

3. Show that 1 - Οἵ - Οὗ +C3+...+-C% = 2". 

4, Interpret Problem 3 in terms of combinations and sampling. 

5. The reliability of a test is often estimated by comparing the results on 
half the questions with those for the other half. In how many ways can a test of 
2n items be split in half? What conclusions do you draw about this method of 
estimating reliability? 
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6. In how many ways can the five letters a, a, ὃ, ὃ, c be arranged if we count 
as different arrangements only those that appear to be different, i.e., if we do not 
distinguish different occurrences of the same letter? 

7. How many different signals can be made with dots, dashes, and pauses 
by using a sequence of five such elements? 

8. What is the probability of getting a full house (three of one kind and two 
of another) in a five-card poker hand? 

9. If a set is classified by several different dichotomies, (4,, A{), (Ag, A$), 

., (A,, A‘), it is partitioned into sets each one of which is the intersection 
of n sets, one chosen from each dichotomy. For example, one of the sets is 
Ay δῶ 420 AZN.... These sets are called cross classifications. How 
many cross classifications are determined by n dichotomies? 

10. In a bag are three black balls, 61, b2, b3, and two white balls, w1, we. 
How many different pairs can be drawn? (What is the sample space?) 

11. Assuming that cach pair in Problem 10 is equally likely, what is the 
probability of any one pair? What is the probability of drawing two black balls? 
two white balls? a black and a white? 

*12. What is the probability that in a sample of r people all have different 
birthdays? What is the probability that at least two have the same birthday? 
%* 13. Find the answer to Problem 12 for r = 10, 20, and 30. 

14. What is the probability that in six throws of two dice, seven does not 
turn up? 

15. Suppose that each of n sticks is broken into one long and one short part. 
The 2n parts are arranged into n pairs from which new sticks are formed. Find 
the probability that the parts will be rejoined as before (i.e., each original stick 
reformed), and find the probability that all new sticks will be formed from one 
short and one long part. (This problem is related to the genetic effects of ra- 
diation. It is taken from An Introduction to Probability Theory and Its Appli- 
cations, by William Feller. In the first four chapters of that book the reader will 
find numerous other interesting problems illustrating combinatorial analysis.) 

16. Suppose in a certain population there are n; blue elements and ne green 
elements. A random sample of r elements is chosen without replacement. Show 
that the probability that exactly x elements are blue is C”:C?~"1/C?, where 
n= n+ no. 

*17. We have seen in Problem 1 that C” is the coefficient of x” in the expansion 
of (a+ x)". For this reason the C” are called the binomial coefficients. Show 


that ΟἹ 5,5) ++ +> 7, 18 the coefficient of 11"1x2"2 . . . ἀκα in the expansion of (x1 + 
zo+...+2,)". For this reason these numbers are called the multinomial 
coefficients. — 


*18. Prove the binomial theorem by a combinatorial argument. (Suggestion: 
How is each term formed in the expansion of (a@-+ x)” = (a+ a)(a+ 2)... 
(a - x) before collecting terms?) 

* 19. Generalize by stating and proving a multinomial theorem. 

% 20. A device of very general utility for attacking combinatorial problems is 
the use of “trees.” This method is presented in An Introduction to Finite Mathe- 
matics, by J. G. Kemeny, J. L. Snell, and G. L. Thompson. Study this method 
as there presented and apply it to problems of your choice. 
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%21. Many interesting examples in combinatorial analysis may be found in 
Choice and Chance,:-by William Whitworth. 

* 22. Show that C7Cy77,/Ch, = Cnepap/Cr. 

* 23. Show that ))'=5(—1)'C? = 0. 

% 24. What is the probability that when five-card poker hands are dealt to 
three players each one has just one ace? 

% 25. What is the probability that one of three poker players has three aces, 
another three kings, and a third three queens, when five cards have been dealt 
to each? 


ANSWERS TO EXERCISES 


(a) People into males and females. Molecules into substances. Any partition. 
(b) People into adults, males, and females. (6) In definition of definite 
integral we partitioned the interval into subintervals, but permitted the sub- 
intervals to have endpoints in common. (d) 1.1012. (6) Immediate from 
(4). (Ὁ 1/C?2 as above! No hand is any more improbable than any other. 
However, there are more mediocre than good hands, and therefore the prob- 
ability of getting one of the set of “good” hands is less than that of getting one 
of the “bad” ones. (9) 1-48/C2". (h) 10. (i) Definition of partition. 

(j) Ce555.5.5- (k) 12. (ἡ 30. (m) 8. (n) The total number of 
possible ways of dealing is given by Exercise (j). Then to get a deal of the kind 
desired we may first choose the player to have the aces (in five ways), decide on 
the fifth card (in 48 ways), and then distribute the remaining 47 cards to the 


ἀν ee EES See. 


C2, 5.5.5,5.5- (0) Ci, since it is a matter of assigning the 10 outcomes to 
two boxes, the a box and the a’ box. (p) Cl®. (q) CH. () 1. (s) C®. 
(t) C®. 
ANSWERS TO PROBLEMS 
: 9 (2n)! 
1. The same. 3. Consider (1 + 1)". 5. Co” = (nl)? One wonders whether 
n! 
the choice of dichotomy might not make a difference. 7. 3° (assuming that 
signals such as five pauses are allowed). 9. 2”. 11. 1/C2, C3/C3, C2/C3, 3 - 2/C3. 
13. For r = 23, the probability is approximately 1/2. 15. 2"/[(2n)!/n!], 2"/C?". 


8-5 Conditional probability. Suppose that the sum of the faces of two 
dice is seven. What is the probability that the faces differ by five? The 
desired event can happen in two ways, symbolized by (1, 6) and (6, 1). 
The sample space here is not the 36 ways in which two dice may come up, 
but merely the six ways in which a total of seven may occur. Hence the 
desired probability is 1/3. On the other hand, without the indication that 
the sum of the dice is seven, the probability that the faces differ by five. 
is 2/36 or 1/8. We see that the probability of an event depends upon the 
set of possible events we are considering. 

Let P[A|B] be the probability that A occurs given that B occurs, or 
the conditional probability of A on the hypothesis B. Then P[A|U] is 
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the usual probability P[A] calculated with respect to the original sample 
space. Sometimes it is called absolute probability. Suppose that U is a 
finite set of equally likely points. Then P[A |B] is the relative frequency of 
A in B, 1.e., the probability calculated as if B were the sample space. 


(1) — P[A|B] = 9(A ἡ B)/9(B). 


Also P[B] = P[B|\U] = 9(B)/aU(U), and P[A nN 8] = σία ἡ B)/IU(U). 
We note that the following holds for the relative frequencies. 


(2) Def. P{[A|B] = P[ANn BY/PIBI. 


We take it as a definition of P[A | B] in all cases where P[B] ~*~ 0. When 
ΡΙΒ] = 0 we do not define P[A |B]. 

For example, letting U = the cases when two dice are thrown, A = 
the cases where the faces differ by five, and B = the cases where the sum 
of the faces is seven, P{[A ἢ B] = 1/18, P[B] = 1/6, and P{[A|B] = 
(1/18)/(1/6) = 1/3, as before. 


(a) Let A = the faces differ by four, B = the sum of the faces is six. Find 
P{A], P[B], Pl. B], and P[4A}B] by enumeration in Fig. 8-1, and verify 
that (2) holds. (b) Do the same, but with B = the sum of the faces is even. 


I'rom (2) we have immediately 
(3) ΡΙΑ Ὁ B\ = ΡΙΒΊΙΡ[Α [Β] = P{AJP[B| A]. 


The first equation is found by multiplying both members of (2) by P[B], 
the second by (B:A, A:B). This law says that the probability that both 
A and B occur is the probability that B occurs (the absolute probability) 
times the conditional probability of A given B, or the probability of A 
times the conditional probability of B given A. This formula is often use- 
ful because it frequently happens that it is easier to find the probabilities 
in the right member than to find P[A M B] directly by enumeration. 
Sometimes events that consist in the intersections of events (i.e., the 
simultaneous occurrence of events) are called compound events. 

For example, consider the following experiment. We have an urn con- 
taining seven blue and three gold chips, and a second containing five 
blue and five gold. We draw a chip at random from the first urn and place 
-it in the second, then draw a chip at random from the second. What is the 
probability that we draw gold chips both times? Here A = gold chip 
on first draw, B = gold chip on second. We have P[A] = 3/10, P[B| A] = 
6/11, and P[A ἡ B] = 9/55. An enumeration of all possible pairs of draw- 
ings is more tedious, though not difficult in this case. 
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(c) What is the probability of drawing both blue chips? (d) first gold, then 
blue? (ὁ) first blue, then gold? (ὃ) What is the probability of drawing gold 
on the second draw? (6) of drawing blue on the second draw? (h) Can we 
use (3) to find P[A M B] if either P[A] or P[B] is zero? 


From the previous examples it is clear that the absolute probability 
P[A] may not be equal to the conditional probability P[A|B]. Usually 
the probability of an event changes if the hypotheses (1.e., the given con- 
ditions or sample space) change. However, it may happen that P[A |B] = 
P{A|U] = P[A]. In this case (3) becomes P[A ὦ 8] = P[A]P[B]I. 
When this equation holds we say that A and B are statistically independent. 

Statistical independence does not mean that there is no relation between 
A and B! On the contrary, there is a very special relation, namely that 
P(A ἡ B)/P(B] = P[A]. In terms of relative frequencies, this becomes 
“(αὶ ἡ B)/9t(B) = (4) απ). In words, the relative frequency of 
A in B is the same as the relative frequency of A in the sample space. 
The situation may be visualized by a Venn diagram in which areas are 
proportional to frequencies. In Fig. 8-2, area (A ἢ B)/area (B) = area 
(A)/area (U). This is a very special situation. The proportionality of the 
frequencies means that the occurrence or nonoccurrence of B has no 
effect on the probability of the occurrence of A. 

Statistical independence must not be confused with mutual exclusive- 
ness. If A and B are mutually exclusive, the occurrence of one precludes 
the occurrence of the other, and they are certainly not independent. 

In practice we assume that two events are independent if the occurrence 
of one has no effect on the conditions influencing the occurrence of the 
other. For example, if we throw a die and then throw it again after 
thorough shaking, experience shows that the relative frequencies of the 
faces on the second throw are the same regardless of the outcome of the 
first throw. We say that the throws are independent. The probability of 


FIGuRE 8-2 


- 
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throwing ἃ seven twice in succession with two dice is (1/6)(1/6) or (1/36). 


Find the probability that: (i) a coin comes up heads twice in succession, 
(1) a pair of dice totals seven on the first throw and six on the second, (k) both 
of two cards drawn with replacement from a deck of 52 cards are aces, (1) both 
of two cards drawn from a deck without replacement are aces. 


We should like to extend the above ideas to more than two events. 
For n events, A1, Ag, ... , An, (3) becomes 


(4) PlAy nN A2N...N An] = P[A,]P[Ae|Aji]P[A3|A1 ἡ Ag]... 
P[An|A1N A2N...N Ag]. 


Under what conditions can we replace the conditional probabilities in the 
right member by the corresponding absolute probabilities? At first sight 
we might guess that it would be sufficient for each pair of events to be in- 
dependent. Actually, this pairwise independence is not enough. We 
shall not go into details, but the interested student may consult Chapter 
5 of the book cited in Problem 15 of Section 8-4. We call a set of events 
{A1,..., An} mutually independent if the probability of the intersection 
of any subset is the product of the corresponding absolute probabilities. 


If Ay, Ao,..., An are mutually independent, 
Pi Ay Ἢ Ag sos 1 Ad] = Ρ[44]Ρ[4..]... ΡΙΑ,]. 


(5) 


In practice, we use this formula when we have reason to believe that no 
combination of occurrences or nonoccurrences of any set of the events 
has any effect on the chances of any other event. Of course, this belief 
may be tested by experimental determination of relative frequencies. 

Formula (5) is one of the most frequently used laws in probability and 
statistics. For example, what is the probability that a coin will come up 
heads 10 times in a row? Assuming independence of the throws, the 
answer is 2 19 or 1/1024. What is the probability of the sequence 
HTHTHTHTHT in 10 throws of a coin? The answer is the same, 2. «Ὁ, 
What is the probability that in 10 throws there will be five heads and 
five tails? This result can occur in many mutually exclusive ways, each 
with probability 2~'°. The number of ways is the number of partitions of 
10 objects into two sets of five each, that is, 10!/(5!)*. Hence the prob- 
ability is 2—1?°10!/(5!)?. 


(m) What law from Section 8—2 did we use to get the last result? (n) From 
a box containing nine red and one white ball, ten balls are drawn with replace- 
ment. What is the probability that they are all red? (0) If the probability of 
a plane’s being shot down during a mission is 1/10, is it certain that it will get 
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shot down if it undertakes ten missions? (p) From a box containing five red 
and five white balls, three balls are drawn, each one being replaced if and only 
if it is red. What is the probability that the third ball is red? 


PROBLEMS 


1. What is the probability that a five-card poker hand has no pairs? Solve 
this problem by direct enumeration and by using theorems of this section. 

2. Suppose that the probability that life appears on a planet similar to the 
earth is 10719. Suppose that the probability of a sun similar to ours having a 
planet similar to the earth is 10—°. What is the probability that a sun similar 
to ours has a planet on which life exists? 

3. Suppose that of the students who cheat on examinations, one out of 100 
is caught. What is the probability that a person who cheats ten times gets 
caught? one hundred times? 

4. Suppose the chromosomes of one parent contain one gene for brown eyes 
and one for blue, while those of the other contain two genes for blue eyes. The 
chromosome of the child contains one gene from each parent. The eyes will be 
brown if at least one gene for brown is present. What is the probability that this 
will happen, assuming that either gene from each parent is equally likely? 

5. What is the probability that a child of the parents in Problem 4 has blue 
eyes? that two children in a row have blue eyes? that of four children, all have 
blue eyes? 

6. Answer the questions of Problems 4 and 5 assuming that both parents 
have one gene for blue and one for brown. 

7. Suppose that one-tenth of the pennies in circulation have heads on both 
sides. Suppose that a penny is picked at random and tossed. What is the prob- 
ability that it shows heads? 

8. In the situation.in Problem 7, if a penny is tossed and shows heads, what 
is the probability that it is a two-headed penny? 

9. A die is to be thrown n times. What is the probability that six comes up 
at least once in the n throws? 

10. That four comes up exactly three times? exactly once? 

11. Suppose that ἢ = AU BUC, where A, B, and C are independent. 
Show that P(D) = 1 — P(A’)P(B’)P(C’). 

12. If a head has come up six times in a row on the toss of an evenly balanced 
coin, what is the probability of its coming up again? 

13. The following quotation is from The Criminality of Women, by Otto 
Pollak: “Another type of fraud in which female accomplices are used is the dis- 
position of fake jewelry by the professional pawner. The faker usually buys a 
quantity of rolled trinkets and works a small piece of genuine gold into each. 
The woman helper takes the product to a pawnbroker and offers it for what it 
may be worth, making no statement as to its being real gold. The pawnbroker 
tests it by acid application and, if he happens to hit the piece of real gold, buys 
the whole trinket as such. If he hits another spot, he refuses to buy. According 
to the law of probability, the woman tries other pawnbrokers until she has dis- 
posed of the whole stock.” Explain the last sentence. 
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14. Suppose that the probability of the pawnbroker hitting the gold is 1/3. 
What is the probability that the woman would be successful with the third pawn- 
broker? What is the probability that she would have to visit no more > than 
three pawnbrokers? 

15. Answer the ae aka in Problem 14 if the probability of a SawnbioRee 
hitting gold is only 1/10. 

16. This problem is inne from First Course in Probability and Statistics, 
by J. Neyman. A procedure for diagnosing cancer of the lung consists of three 
steps: 1. Collect a sample of sputum from the patient. 2. Put a small sub- 
sample of this sample on a slide. 3. Examine the slide for cancerous cells. Let 
p1 = probability that sputum from a cancerous patient contains cancerous | 
cells, po = probability that the subsample contains cancerous cells given that 
the sample does, and p3 = probability that cancerous cells will be observed on 
a slide containing some such cells. If this procedure is followed once, what is the 
probability that a cancer will be discovered in a cancerous patient? 

17. What is the probability that it will not be discovered? 

18. What is the probability of correct diagnosis if the procedure is followed 
twice? 

19. Ten times? 

20. Suppose & samples of sputum are taken, m slides made from each sample, 
and each slide examined n times. Assuming that all kmn operations are statis- 
tically independent, find a formula for the probability of a correct diagnosis. 

21. If pi = 0.80, pe = 0.25, p3 = 0.90, how large must k, m, and n be so 
that the probability of correct diagnosis is at least 95%? 

* 22. A coin is tossed until heads appears twice in succession. What is the prob- 
ability that this happens on an even throw? odd throw? 

% 23. Answer the same questions for tails. 

% 24. Answer the same questions when either heads or tails comes up twice in 
succession. 

* 25. Suppose that p is the probability that an individual in a community 
favors a certain verdict in a certain jury-trial case. Find the probability that the 
majority of a 12-man jury picked at random favors this verdict. 


ANSWERS TO EXERCISES 


(a) 1/9, 5/36, 1/18, 2/5. (Ὁ) 1/9, 1/2, 1/9, 2/9. (ὁ) 21/55. (ἃ) 3/22. 
(e) 7/22. (f) 538/110. (g) 57/110. (8) The conditional probability is not 
defined, but obviously P[A MN B] = 0. (i) 1/4. (j) (1/6)(5/36). (k) (1/18)?. 
(1) Events not independent; (1/13)(3/51). (m) (8-2-3). (n) (0.9)!® (0) No, 
the probability of its survival is the same as the probability in Exercise (n). 
(p) Events not independent. There are four mutually exclusive ways this can 
happen, RRR, RWR, WRR, and WWR. The probability is (5/10)(5/10) 
(5/10) + (5/10)(5/10)(5/9) + (5/10)(5/9) (5/9) + (5/10) (4/9) (5/8). 


ANSWERS TO PROBLEMS 
5(13 ΖΑ -- — 10 = 1/10: 1 — (0.99}100 = 0.73. 
ae ye) 0.507. 3. 1 — (99/100) /10; 1 — (0.99) 0.73 
5. 1/2;1/4:1/16. 7. (9/10)(1/2) + 1/10. 9. 1 — (5/6)". 11. P(D) = 
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P(D’) =1— P(A’N Β' NC’). 13. By chance, some pawnbroker is very 
likely to hit the gold. 15. (9/10)2(1/10); 1/10 + (9/10)(1/10) + (9/10)2(1/10). 
17. 1 — pipeps. 19. 1 — (1 — pipeps)?. 20. 1 — (1 — pipeps)*”". 
21. kmn = 16. 


8-6 Discrete distributions. Let v = the sum of the faces when two 
dice are thrown, let f(z) = P[v = x] and F(x) = Plv < x]. The graphs 
of f and F are sketched in Figs. 8-3 and 8-4. Each of these functions 
tells us everything about the probabilities associated with different values 
of v, for f tells us directly the probability that v = 2, since f(x) 1s this 
probability, and F tells us indirectly, for the probability that v = 2 1s 
given by F(x) — F(x — 1), the height of the step in F at x = v. More 
generally Plz, < x < xe] = F(a2) — F(a). If we think of zero prob- 
abilities associated with values of v other than the 11 possibilities, the 
domains of f and F are the real numbers and their ranges are subsets of 
the interval (0 1). 


FiGurE 8-3 FIGurRE 8-4 


Let S be a sample space, let r be a function whose domain is S and 
whose range is a subset of the real numbers, and let v = r(u). Then we 
call v a random variable. Of course, v is just an ordinary variable in every 
way. The term “random” merely indicates that we are using it as the 
dependent variable in a function that maps a sample space into real num- 
bers. In the dice example, S is the set of points in Fig. 8-1, u = (a, ὃ), 
where a is the face on the first die, b is the face on the second, and v = 
a -+ ὃ gives the mapping. 

Now let f(x) = Plv = xz] = the probability that the random variable 
has the value x. We call f the frequency function of v. It gives the prob- 
ability (relative frequency) of each value of v. Let F(x) = Ply < .] = 
the probability that v has a value less than or equal to x. We call F the 
distribution function of v. Sometimes F is called the cumulative distribution 
function. The word “distribution” comes from thinking of a total prob- 
ability of 1 as distributed over the range of v. In these definitions there is 
no limitation on the nature of S nor any requirement that the range of r 
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be finite. In this section we consider only random variables whose ranges 
are finite. The corresponding distributions are naturally called discrete. 
(See Section 6-8, Problem 15.) 


(a) Let a coin be tossed twice. Let heads be represented by 1, tails by 0. 
Letting x = result of first toss, y = result of second, graph the sample space 
of couples of tosses. Letting v = the number of heads in two tosses, find and 
graph the frequency function and distribution function οὗν. (Ὁ) Do the same 
letting » = the number that turns up when one die is thrown. 


We note that for a discrete frequency function 


(1) . | f(x) = 0, 

(2) Σ fe) = 

where the summation extends over the domain of ἢ. 
(3) F(«) = 2. 700, 

(4) ry > 21 > F(xs) > F(z), 
(5) jim F@ =0, lim F@) = 


Any function that satisfies (1) and (2), ie., is non-negative and whose 
values add to one, is the frequency function of some random variable. 
Also, any function that satisfies (4) and (8), .1.6., is monotonic increasing 
from 0 to 1, is the distribution function of some random variable. 

Let g be a function that maps the range of a random variable v into a 
set of real numbers. Then we define the expectation of g(x) by 


(6) Elg(x)] = >) 92s), 


where f is the discrete frequency function of v and the summation extends 
over the domain of f. Suppose, for example, that an individual is gambling 
on the value of v and that g(x) is the payoff he receives corresponding to 
v = x, where g(x) < 0 means that he has to pay out |g(x)|. Then if he 
gambles many times under the same conditions, he may expect to receive 
g(x1) a proportion f(#1) of the time, σία 2) a proportion f(z) of the time, 
and so on for all values of x. On the average, then, he may expect to re- 
ceive E[g(x)]. If he plays 100 times he may expect to receive 100E[g(z)], 
since g(z;)[100f(x;)] is the amount he might expect to win from the oc- 
casions when v = ὅς. 

For example, if two people bet even money on the cise of two dice 
with the understanding that the thrower wins $1 if he throws a seven and 
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pays out the same if he does not, we have g(x) = the payoff to the first 
player when the outcome is x. Then g(7 comes up) = 1 and g(7 does not 
come up) = —l. The corresponding probabilities are 1/6 and 5/6, so 
that the player’s expection is (1/6)(1) + (5/6)(—1) = —2/38. That is, 
he may expect to lose 2/3 of a dollar on the average each time he plays 
this game. 


(c) What should the bets be in this game in order for it to be fair, i.e., so that 
the expectation of each player is zero? (d) Suppose A agrees to pay B an 
amount equal to the sum of the faces of two dice that he throws. What is B’s 
expectation? (e) What should A require B to pay him each time he plays 
this game in order to break even in the long run? 


Imagine an experiment in which there are only two possible outcomes, 
success with probability p, and failure with probability ῳ = 1 — p. 
Suppose that this experiment is repeated n times under identical conditions, 
so that each experiment is independent of the others. Let v = the number 
of successes in the n trials. What is the frequency function of v? We have 
f(x) = the probability of exactly x successes out of n trials. Letting 1 
be a success and 0 a failure, a possible outcome of the trials may be repre- 
sented by a sequence such as O11010111...10. By (8-5-5) the prob- 
ability of any such outcome is p*qg”~*. But there are many different se- 
quences in which z 1’s and (n — x) 0’s appear. Indeed, from Exercise (s) 
in Section 8-4, there are just CZ. Hence we have ΟἿ mutually exclusive 
ways, each with probability p*q”~*, and 


(7) Τὰ) Ἐξ *. 


This distribution 1s known as the binomial distribution. 


(f) Explain the name. 

Sketch the frequency function of a binomial variable for n = 4 and p as fol- 
lows: (g) 0.1, (h) 0.2, (i) 0.8, (j) 0.4, (k) 0.5, (1) 0.6, (m) 0.7, 
(n) 0.8, (0) 0.9. (Keep your results for future reference in Chapter 9.) 


The frequency function or the distribution function of a random vari- 
able (given by a table, graph, or defining formula) tells us everything about 
the situation. However, it is sometimes convenient to find certain numbers 
that are characteristic of a distribution and that suggest its important 
features. Among these the most common is the first moment or mean μ 
(pronounced “mew”), defined as the expectation of x, that is, 


(8) p= Elz] = >> χα). 


The term “moment” arises in the following way. Let us imagine the 
probabilities f(z) as weights located at the points (2, 0) along the z-axis. 
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FIGURE 8-5 


If we place a fulcrum at the origin, the moment of f(x) about it is 2f(z), 
the weight times the lever arm. (See Fig. 8-5.) The sum (8) is the total 
moment of all the weights. The total weight is one by (2). If we place a 
weight of one at a distance (£[z], 0), the system will be in balance. Hence 
E{x] is both the first moment and the average lever arm. | 


(p) Let « = the number of heads that appear when a coin is tossed once. 
Find the frequency function and first moment of x (q) Find the first mo- 
ments of the distributions in Exercises (g) through (0). 


Another frequently used number associated with a distribution is its 
variance defined as the expectation of (x — E[x])?. It is symbolized by 
o” (sigma squared), and σ᾽ is called the standard deviation. We have 


(9) σὲ = Ee — ἡ), = >) @ — Οὐ). 


The variance is simply the weighted average of the squares of the distances 
of each value of x from the mean. It is a measure of the extent to which the 
points are spread out, the greater the variance the greater the average 
distance. Squaring the differences eliminates negatives, which is desirable 
since we are interested in how far away each point is and not in the direc- 
tion. The same result could be accomplished by using E[jxz — ul], called 
the mean deviation, but the variance is more widely used. 


(r) Find the variance of the distribution of Fig. 8-3. (8) Find the mean and 
variance of the distribution in Exercise (a). (Ὁ) Do the same for Exercise (b). 
(u) For Exercise (g). (v) For Exercise (k). 


PROBLEMS 


1. If out of 100 people age 5, 1 dies at age 10, 1 at age 20, 2 at age 30, 2 at 
age 40, 6 at age 50, 11 at age 60, 19 at age 70, 27 at age 80, 23 at age 90, and 
the rest at age 100, what is the expectation of the age at death (life expectancy) 
of a person in the group? What is meant when it is said that the life expectancy 
of a person in the United States is 65 years? 

2. A coin is tossed 10 times. What is the probability that it comes up heads 
five times and tails five times? 

3. If the probability of success of an experiment is 1/100, how many trials 
are necessary to have a probability of 1/2 of getting at least one success? 

4, Show that for the binomial distribution, the first moment μι = np. Why 
is this reasonable? : 
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αὐ. Show that the variance of the binomial distribution is npq. 

6. A distribution is called uniform if the frequency function is a constant 
function. Give an example. What is the appearance of the distribution function? 

7. Suppose that a “wheel of fortune” has 12 numerals, 1 through 12, and is 
equally likely to stop at any one. Let v = the number at which it stops. Sketch 
the frequency function and the distribution function. 

8. In the situation in Problem 7, if the sponsor gives away $1000 if the wheel 
stops at the number chosen by the contestant, what is the expected cost? 

9. Suppose in the above situation, the sponsor merely gives the contestant 
a chance to win $1000 by correctly answering a question on which three out of 
four people fail. What then is the expected cost? 

10. Let » = the number of aces in a five-card hand. Find the frequency func- 
tion and sketch it. Find the first moment and the variance. 
%11. Since the values of f(x), where f is a discrete frequency function, are 
bounded from above, there must be a maximum. If f(x) is this maximum, then 
x is called a most probable value of x. What is (or are) the most probable value 
(or values) of a binomial random variable? 
% 12. The formula in Problem 16 of Section 8-4, where nj, n, and r are fixed, 
is a defining formula for a frequency function called the hypergeometric distribu- 
tien. Sketch it forn = ὅ, πὶ = 2,r = 3. Find its first moment and variance. 
13. Let v be the number of hearts in a 13-card hand from a full 52-card deck. 

Write the frequency function of v and sketch it. What is the first moment? What 
is the most probable value or values? 


*% Poisson distribution. Imagine a sequence of trials to which the binomial 
distribution applies, and suppose that the number n of trials is large but the 
probability p is small, so that their product np is of moderate size. The com- 
binatorial coefficient in (7) is difficult to compute. Under these conditions it 
can be proved (see Feller’s Probability Theory and [is Applications or Neyman’s 
First Course in Probability and Statistics) that the following approximation is a 
good one. 


(10) Crip q -Ξ (ἀρ) ε΄ αὶ. 


More precisely, it is true that if n - © and p -- 0 in such a way that np re- 
mains fixed, then the left member of (10) approaches e~"?(np)*/x! Letting 
\ = np, we have 

—A, z 


᾽ 


(11) pe) = — 


which is called the Poisson distribution. Its first moment is X. 

As an example of the use of this approximation, consider the probability of the 
appearance of five cases of a disease in a population of 100,000,000 where the 
probability of getting this disease is 1/50,000,000. The left member of (10) 
would be unmanageable! But here ἃ = 2, and the approximation gives us 
e—225/5!, or about 0.037. 
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14. In this example, find the approximate probabilities of the appearance of 
0, 1, 2, 3, 4, 5, 6 cases and graph your results. 

In practical problems when we are faced with an event that occurs rather 
rarely (p small) in many trials (n large), we take the observed average number 
of occurrences to be an estimate of A. Then we use p(x) as an estimate of the 
probability of z occurrences. Typically we have an event that occurs ἃ times on 
the average during a time interval, and we wish the probabilities of different 
numbers of occurrences during the time interval. 

15. Suppose a radioactive substance emits an average of four particles per 
unit time. What is the probability that it will emit none, 1, 2, 3, 4, and 5 parti- 
cles in a particular unit time interval? (Here we can imagine each of the n par- 
ticles it could emit as a trial.) Graph your results. 

16. Suppose that the average number of bacteria per unit area on a plate is 
three. What is the probability that a particular unit area contains bacteria? 
What does this mean in terms of the number of unit areas that may be expected 
to contain no bacteria? 


ANSWERS TO EXERCISES 


(a) {(1,0), (0, 1), (0,0), (1, 1}; f0) = Κῶ) = 1/4, fC) = 1/2. (b) {1, 2, 
3,4, 5,6};f = 1/6. (c) The thrower should pay $1 when he loses and receive 
$5 when he wins. (4) 7. (e) 87. (ὦ) The distribution is given by (p + q)”. 
(p) f() = f(1) = 1/2,u = 0(1/2) + 11/2) = 1/2. (q) 0.4, 0.8, 1.2, 1.6, 2, 
2.4, 2.8, 8.2, 3.6. (r) 85/6. (s) 1,1/2. (Ὁ 7/2, 85/12. (u) 0.36. (v) 1. 


ANSWERS TO PROBLEMS 


1. 74.9. 2. 63/256. 38. Probability of at least one success is 1 — (0.99)”. 
For this >1/2, (0.99)" < 0.5, nlog 0.99 < log 0.5, or n > log 0.5/log 0.99. 
(Note change of direction because log 0.99 < 0.) Then n > 75. 7. Uniform, 
f = 1/12. 9. About $21. 

13. f(z) = Cl8C33__/C$2. 15. e~447/x!. 16. 1 — e~%. We may expect 
about 2% to contain no bacteria. 


8-7 Continuous distributions. Consider, as suggested in Fig. 8-6, a 
pointer free to rotate about its center and balanced so that it is as likely 
to come to rest at one point as at another. It is easy to see that the prob- 
ability that it comes to rest at any particular point is zero. For, suppose 
that the probability of a point is € > 0. There are more than 1/e points, 
indeed there are an infinite number of points. Hence the total prob- 
ability would be more than (1/e)e, i.e., more than 1, which contradicts 
(8-2-8). 

To deal with the situation we must consider the probability that the 
pointer comes to rest in some interval. Since we naturally feel that the 
pointer is twice as likely to come to rest in an interval twice as long, we 
are led to define the probability of an interval as the length of the in- 
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terval divided by the total length of the circumference, that is, P[J] = 
L(1)/L(U), where U is the sample space. Note the analogy of this with 
(8-3-1). In each case we take the probability of a set as a measure of the 
set divided by the same measure of the sample space. 

In the case of the rotating pointer, L(U) is the length of the circumfer- 
ence. If we choose units so that this length is 1, designate a point on the 
circumference as origin, and let v = the shortest clockwise distance on 
the circumference from the origin to the point of rest, then P[v < z] = 2. 
The cumulative distribution function of v 1s as sketched in Fig. 8-7. 

Let %2 > 21, then F(x,) — F(x,) 1s the probability of the interval 
(σι 2); and F(x2) — (αι) (ας — 2), the slope of F, is the prob- 
ability per unit length. If we think of the probability as a unit mass dis- 
tributed over the interval (0 1), this slope is the mass per unit length, 
1.6., the density. Let f(x) = the slope of F at (#, F(x)). In this case 
f(x) is 1in (0 1), undefined at 0 and 1, and 0 elsewhere. It is sketched in 
Fig. 8-8. 


(a) Suppose that the circumference of the dial is 2. Sketch Fandf. (b) Deo 
the same for circumference 1/3. (6) Do the same for circumference 6. 


To generalize the above example let us imagine a random variable v 
with a distribution function F that is continuous and that has a derivative 
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FIGURE 8-9 


everywhere except possibly at a finite number of points. Since it must 
satisfy (8-6-4) and (8-6-5) it will have an appearance something like 
Fig. 8-9. We call DF the density function, designate it by f, and call 
f(x) the probability density. Where probability is more concentrated, F 
is steep and f(x) is large. Where F is horizontal, f(x) is zero. When f(z) 
is constant over an interval and zero elsewhere, we call the distribution 
uniform. Figure 8-8 is typical of a uniform distribution. 


(4) Is a distribution function F’ necessarily steepest at x = 0? If it is, what 
can one say of the density function f at x = 0? (ὁ) What is the appearance 
of f if F is symmetric about the point where it crosses the y-axis? (ἢ Show 
that a continuous distribution function F must satisfy (8-6-4) and (8-6-5). 
(6) Show that a density function f satisfied (8-6-1). 


To get theorems corresponding to (8-6-2) and (8-6-3), we must re- 
place the summations by integrals. For, since DF = f, 


(1) Ply <0 < ea] = Flos) — Fm) = fF. 


That is, the probability over any interval is the integral of the density 
function over that interval. Letting F(—o) = lime. F(x), F(«) = 
lim, _,. F(x), and defining 


ae at 
oy Dee frm fire ΥΣ 
we have | 


(4) F(x) = fe: ie 
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which corresponds to (8-6-3), and 
(5) f fa 


which corresponds to (8-6-2). We use —o and o as limits of integration 
because we wish to leave open the possibility that the density may not 
be zero no matter how far out we go. Of course, if f(z) = 0 outside some 
interval (a δ), then we may change —o to a and οὐ to ὃ in the above 
formulas. 


(h) Explain carefully the justification of (1). () Derive (4) from (1) through 
(3). (Ὁ Do the same for (5). (k) Verify that (1), (4), and (5) hold for the 
distribution of Fig. 8-7. 

Sketch each of the following density functions, find and sketch the distribution 
function, and verify that (5) holds: (1) f(z) = 2x in (0 1), zero elsewhere; 
(m) f(x) = 1/2 in (1_}), zero elsewhere; %*(n) f(z) = e~* for x > 0, zero 
elsewhere. 


In strict analogy to (8-6-8) and (8-6-9), we define the mean and 
variance of a continuous distribution by 


(6) = [. af(x) dx, 
(7) ? = i ( — μ) 1) de. 


(o) Find the mean and variance of the distribution of Fig. 8-7. (Ὁ) Do 
the same for the distribution of Exercise (4). (α) Do the same for Exercise (0). 


PROBLEMS 


1. We may regard Fig. 5-19 as giving the distribution function for which 
F(x) = the probability that the pitch x will be discriminated = the probability 
that the minimum pitch discriminated is less than or equal to x. Graph the 
density function f. What is f(z)? Find the mean and variance. 

2. Argue that if f is the probability density, then f(z) Az gives approximately 
the probability of the interval (x w2-+ Az). 

3. Suppose Μ᾽ (2) = the probability that an individual dies at or before the 
age of z, What would you expect the graph of F to look ike? Sketch a plausible 
graph. Sketch the corresponding f. What is f(x)? | 

4, The expectation of g(z), where x has the continuous probability density 
f(x), is defined by 


(8) Big(@)) = fof. 


Argue for the reasonableness of this and its consistency with (8-6-6). 
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5. Show that (6) follows from the first equation in (8-6-8) and from (8) 
above. 

6. Derive (7) similarly. 

*7. Show that if ce~’ is to be a probability density function in the interval 

(0 οοὐ, thene = 1. 

8. Suppose f(z) = εἴου x € (a__b) and f(x) = Oforxz ¢ (a_b). Find the 
value of c so that f is a density function. 

9. Find the first moment and variance for the distribution in Problem 8. 

10. Sketch the distribution function corresponding to f of Problem 7. 

11. Determine c so that ἢ, defined by f(z) = c(mx-+ δ) for x Ε (0 10) and 
f(z) = 0 otherwise, is a density function. Sketch it and the corresponding dis- 
tribution function form = —1, ὃ = 2. 

12. What is the expected value of x in the pointer problem? 

13. Suppose a point is chosen at random inside a circle of radius 2. What is 
the probability that it lies inside the concentric circle of radius 1? 

14. To answer the question in Problem 13 we need to extend the definition 
of probability to cover areas, since the sample space is a plane area and the 
events are subsets of it. Suggest a definition for probability of a subset of a plane 
area on the assumption that all points are equally likely. 

15. With the definition chosen in Problem 14, what is the probability that a 
point chosen at random in a unit circle lies at the center? 

16. Lies on a specified diameter? 

17. Lies in a region of unit area? 

* 18. A board is ruled with a series of equidistant parallel lines. A needle whose 
length is less than the distance between the lines is thrown at random on the 
board. Letting d be the length of the needle and ἢ the distance between lines, 
show that the probability that the needle intersects a line is 2d/mh. (Since the 
probability should be approximated by the relative frequency in many trials, 
this formula can be used to make an experimental determination of +!. The 
idea is due to the Comte de Buffon (1707-1788), and the problem is called 
Buffon’s needle problem. See Introduction to Mathematical Probability, by 
J. V. Uspensky, p. 112, and elsewhere.) 

% 19. Determine a so that F(x) = 1 — az? is a distribution function. Find 
f. Graph F and f for ὃ = 1.5. (This is the Pareto distribution, in which F(z) 
is the proportion of the population having an income of z or less. See The Theory 
of Econometrics, by H. T. Davis, p. 23.) 

* 20. Show that if a measure 117 is defined for subsets of a sample space U and 
probability is defined by P(A) = M(A)/M(U), then (8-2-1) through (8-2-4) 
are satisfied. 


ANSWERS TO EXERCISES 


(a) F(x) = 2/2for0 < x < 2, F(x) = Oelsewhere. f(x) = 1/2 in the same 
interval, Qelsewhere. (Ὁ) F(x) = 3xin0 < x < 1/3,f(z) = 3. (c) F(@%) = 
ajeinO<2x<e, f(x) =1/enm0<2<ec. (d) No. Has its maximum. 
(e) f is symmetric about the line x = 0. (f) Since F(x2) — F(a1) = σι < 
v < zoel, F(x2) — F(x1) > 0 by (8-2-2). This implies (8-6-4). Also, since 
F(z) = the probability that v < x, and since the probability is 1 that v takes 
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some value in (—o% οοἹὐ, (8-6-5) follows. Note that these arguments are quite 
independent of whether the distribution is discrete or continuous. 

(g) Since F(x) is nowhere decreasing by (8-6-4), its derivative f(x) is nowhere 
negative. (8) The first equation comes from the definition of F(z). We have 
P(x. «υ = x2) = P(v < x2) = Pv < 21) = F (2x2) 7 F (21). The first equa~ 
tion here follows from (—% 21) U (αι 22) = (—®__x2). (4) In (1) let 
v1 approach —®. (k) Note that in this case the integrals have finite limits. 
(1) through (n) Strictly speaking f is undefined at endpoints. (0) 1/2, 1/12. 
(p) 1, 1/3. (q) ¢/2, 2/12. 


ANSWERS TO PROBLEMS 


1. f(z) Ax is approximately the probability that a pitch in the interval 
(x «+ Az) will first be discriminated. 3.5, 1/3. 3. Starts at origin, rises 
steeply for very short interval (infant mortality), then rises rather slowly and 
steadily until a more rapid rise after age 50, approaches 1 after age 80. Same 
general shape as first graph in Fig. 8-9. 7. Integrate ce? in the interval 
from 0 to ὃ and then let ὃ -- ©. 9. a+ (ὃ — a)/2, (ὃ — a)?/12. 11. (50m+ 
100) “1. 13. 1/4. 14. Area of set over area of space. 15. Zero. 16. Zero. 
17. 17π. 


8-8 The normal distribution. The most useful of all continuous dis- 
tributions is the normal distribution whose density function g is given by 


1(5.::- 2 


— 1 oe σ 
() g(x) = my aed 


over the interval (—o__o). This function obviously satisfies (8-6-1), 
and it also satisfies (8-7-5), though the proof is too advanced to be given 
here. The parameters » and o are respectively the mean and standard 
deviation of the distribution. 

Since we can always make the mean zero by choosing it as the origin 
for our measurements, and since we can make σ᾽ = 1 by choosing an ap- 
propriate unit of measurement, we work with the standard normal dis- 
tribution defined by 


(2) Ἰ (en) fe, 
The relation between x and z is 
(3) z2=(4— p)/o or © =oz+ μ. 


Table VI in the Appendix gives f(z) for selected values of z. Since 
the function is even, we do not need a separate tabulation for z < 0. 
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Figure 8-10 


Figure 8-10 shows the standard normal density function. Since 
(4) Df(x) = f@) - (—2), 


by (7-14-12) and (7-7-1), we see that Df(z) = 0 only at z = 0. Hence 
the curve has no other maxima or minima. Since 


(5) D’f(z) = fz) - (--1) + f@) - (—2)? = f@@?* — 1), 


the curve has inflections at (41, (27e)~'/"). It evidently approaches the 
z-axis asymptotically. 

The more general normal distribution (1) is similar, with maximum at 
x = μ and inflections atz = wig. 

To find the probability that a normally distributed random variable 
lies in an interval (a__b) we need to evaluate the integral 


(6) F(z) = is (Ὡπ)- 1,3. 112 


We cannot do this by finding a simple formula for the antiderivative, 
since there is no elementary function whose derivative is the integrand. 
(An elementary function is one constructible from the constant functions, 
I, οἷ, loge, and the trigonometric function, and their inverses by the opera- 
tions of addition, subtraction, multiplication, division, root extraction, 
and composition.) However, the integral (6) can be approximated by 
various means. It turns out to be most convenient to work with the 
integral 


ἐν H(z) = F(z) — F(0) = iA (πὴ) - 1,3. 1 13 


Table VI gives H(z) for selected values of z. Since F(0) = 0.5, we have 
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F(z) = H(z) + 0.5forz > 0. Forz < 0, wehave F(z) = 1 — F(z) = 
0.5 — H(—z). 

(a) Why does F(0) = 0.5. (Ὁ) Why does F(z) = 1 — F(—z) for z < 0? 
(c) Justify (7). 


By making use of the Table VI and (8-7-1) we can find the probability 
that any normally distributed variable lies in a certain interval. Tor 
example, what is the probability that a variable following the standard 
normal distribution has a value less than 1? From Table VI, H(i) = 
0.341; hence F(1) = 0.841. The probability that it is greater than 1 is 
therefore 1 — 0.841 = 0.159. What is the probability that 1t deviates 
from the origin by less than I, i.e., that —1 < v < 1? Clearly the prob- 
ability is twice that for the interval (0 1), that is, 2H(1), or 0.682. The 
probability that it deviates more than 1 is therefore 0.318. In general, 
Ρ[--α < v < a] = 2H(a). It is because probabilities of this kind are 
often needed that H(z) is tabulated instead of F(z). 


What is the probability that a standard normal random variable is: 
(4) greater than 2? (6) less than 2? (f) greater than —2? (g) less than 
—2?  (h) deviates from the origin by less than 2? (1) deviates from the 
origin by more than 2? () lies in the interval (1__2)? 


By using (3) we can answer similar questions for a normal random 
variable with mean μ and variance σ΄. If x is given, we find the cor- 
responding z from (3) and use Table VI as before. For example, what is the 
probability that a normal random variable deviates from the mean by less 
than the standard deviation? This deviation means that 7 ~ μ =o, 
hence αὶ = 1, and we have from above the probability 0.682. This means 
that if we sample from a population following the normal distribution, we 
expect deviations from the mean of less than the standard deviation about 
2/3 of the time. 


What is the probability that a normal random variable differs from the mean 
by: (k) lessthantwiceg? (1) less than 30? (m) morethang? (n) more 
than 2c? (0) more than 3c? 

Assume that the weights of people are normally distributed, so that the 
actual weights of a set of individuals may be viewed as a random sample from 
an infinite normal population. Suppose that the average weight is 175 lb and 
the standard deviation is 25. Out of 1000 people, how many would you expect 
to have weights: (p) between 150 and 200? (q) between 125 and 225? 
(r) more than 225? (8) less than 125? 

(t) Suppose that an instructor of a large class grades “on the curve” by 
finding the mean yw and standard deviation σ᾽ of all numerical grades, dividing 
the interval (---ο ©) into five intervals at the points μ — 1.50, μ — 0.50, 
uw + 0.50, and uw -+ 1.50, then assigning E, Ὁ, C, B, or A according to the inter- 
val in which the numerical grade lies. About what percentages of students get 
each grade? 
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(u) Complete the following: In a population that is normally distributed, 
about __ percent of the members lie within one standard deviation of the mean, 
___ percent within two standard deviations, and __ percent within three standard 
deviations. 

PROBLEMS 


1. Show that g(x) and f(z) satisfy (8-6-1). 

2. Find f(0). 

3. Sketch (1) for μ = 2,¢ = 3. Show the points of inflection. 

4, Suppose »v is distributed normally. Let & be a number such that P(u — 
Esv<spt+ ΕἸ = 1/2. Find £ in terms of μ andc. 

5. When measurements of a physical entity are made, it often happens that 
they are distributed approximately normally with uw equal to the true value. 
Supposing this to be the case, what percentage of the measurements would be 
expected to fall within 0.67¢ of the true value? 

* 6. Evaluate fi" f from the table for H(z). Evaluate it, using (7-12-10), 
and compare. 

7. Make a table showing the probabilities that a normally distributed vari- 
able lies more than 1, 2, 3, 4 standard deviations from the mean. 

8. Suppose 2500 students are tested on an examination in which the mean is 
500 and the standard deviation 100. Assuming normality, how many might be 
expected to score 750 or over? 


Problems 9 through 14 refer to the examination described in Problem 8. 
Find the expected percentage who score. 


9. Between 400 and 600. 10. Less than 400. 11. Less than 300. 
12. Over 600. 13. Over 790. 14. Over 799. 


*15. A manufacturer guarantees batteries to last 24 months. He wishes less 
than 10% of his batteries to last more than 30 months, but he wants to cut to a 
minimum the number that fail before 24 months. Assuming the mean life to be 
24, what o should be the engineer’s aim in design? 

* 16. In the same situation, suppose the manufacturer wishes to have less than 
1% of the batteries fail before 18 months and wishes to minimize the number 
that last more than 25 months. If u = 24, for what o should the engineer aim? 
* 17. If the engineer may choose both » and o, what should be his choice in 
Problem 16? 

ANSWERS TO EXERCISES 


(a) The graph of f is symmetric with respect to x = 0, since f(—z) = f(z). 
Since the whole area is 1, F(0) = 0.5. (0) F(z) = Plo “ 2] = 1— 
Ρίυ > 2] = 1 — Plo < --2]. (ce) (8-7-1). (d) 0.023. (e) 0.977. (ἢ) 0.977. 
(6) 0.023. (h) 0.954. (i) 0.046. (j) 0.977 — 0.742 = 0.235. (k) 0.954. 
(1) 0.998. (m) 0.318. (n) 0.046. (0) 0.002. (p) 682. (q) 954. (τ) 23. 
(s) 23. (Ὁ) 7; 24; 38; 24; 7. (u) 68, 95, 99.8. 


ANSWERS TO PROBLEMS 


1: Since γα δ᾽ > 0. 3. Max at (2, 1/38V 27). Inflections at x = -—-1 and 
x = 5. 5. About 50%. 7. 31.7%, 4.6%, 0.27%. 9. 68%. 11. 2.38%. 13. 0.2%. 
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8-9 Laws of large numbers. From the beginning (Section 8-1) we have 
viewed probability as a number that gives an expected relative frequency 
in the long run. If the probability is p, we expect the event to occur about 
np times in n independent trials. Indeed, if this does not happen, we usually 
conclude that the probability is not correctly calculated. The kind of be- 
havior we expect is illustrated in Fig. 8-11, which gives the observed 
proportion x/n of successes x in n independent trials in each of which the 
probability is 1/2. We note that as n gets larger, x/n seems to tend toward 
1/2, though there is considerable oscillation. The graph was constructed 
from Table V in the Appendix, which gives a record of successes (1) and 
failures (0) in 2500 trials. If we think of 1 as heads and 0 as tails, reading 
along successive lines in Table V yields the sort of experimental results 
that would be obtained by tossing a perfectly balanced coin 2500 times. 


(a) Make a chart similar to Fig. 8-11 for line 2 in Table V. 


It is because of results such as those indicated in Fig. 8-11 that we find 
probability theory very practical. It enables us to predict the relative 
frequency of events with a margin of error that tends to become smaller 


τη 


0.4 


10 20 30 40 60 80100 200 300 8500 1000 2000 


Fig. 8-11. Proportion z/n of successes in n independent trials with p = 0.5, 
n = 1,2,..., 2500. [Logarithmic scale for n used to contract graph horizontally; 
i.e., the point corresponding to (n, x/n) is plotted at (logion, z/n).] 
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as the number of observations gets larger. Of course, we cannot go so 
far as to say that the relative frequency always approaches the probability 
as the number of trials gets large. If 2 is the number of successes in n 
independent trials with probability p, we cannot assert that χη -- p as 
n— «©. It is quite possible, though very unlikely, that in some particular 
experiment 2/n will not approach p at all. However, from the theory of 
probability we can prove that the probability that 2/n differs from p 
by less than any quantity, however small, approaches one as n approaches 
infinity. Such theorems, which give a precise form to our intuitive notion 
of the relation between probability and the results of repeated experi- 
ments in the long run, are called laws of large numbers. 

The following theorem, due to the nineteenth century Russian mathe- 
matician P. L. Chebyshev, gives insight into the meaning of the variance 
of a distribution and is useful in proving laws of large numbers. 

(1) Let v be a random variable with mean uw and variance a%. 
Then for any t > 0, Ρ[[υ — p| > ἢ < 0? /t?. 


The proof is easy. In (8-6-9), a” is given as a sum over all values of z. 
Since all terms are positive, we have 


(2) σ Ὁ >) @— ) 7), 


where the sum is over just those values of x such that [2 — μ|Ἅ > t. Since 
each squared term in the right member of (2) is greater than or equal to ἐ, 
we have 0? > Yit?f(xz) = UL f(x) = Pilz — μι > ἢ. Dividing both 
members of the last result by ¢? gives (1). 


(Ὁ) Explain each step in this proof. (c) Our proof was for discrete dis- 
tributions. Give a similar proof for continuous distributions. 


Chebyshev’s inequality tells us that the probability that a random 
variable differs from its mean by more than any number 15 less than the 
variance divided by the square of this number. The larger we make the 
number, the smaller is the probability. Roughly speaking, the variable 
is likely to be near its mean and very unlikely to be very far from it. 
More specifically, substituting (ἐσ), we have 


(3) Pllv — μὶ = no] < 1/n’. 


For example, the probability that a random variable differs from its mean 
by more than twice the standard deviation is less than 1/4. 


(d) What can we conclude from (3) about the probability that a random 
variable differs from its mean by more than three times the standard deviation? 
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(e) How does this compare with the probability that a normally distributed 
variable differs from its mean by more than three? Why the difference? 


From Chebyshev’s inequality we can derive the following theorem, 
which is sometimes called the law of large numbers. | 


Let x be the number of successes in n independent trials in 


(4) each of which the probability of success 1s p. Then 


lim P[ix/n — pl > εἸ = 0. 

Roe 
In words, the probability that the relative frequency differs from the 
probability of success by more than any amount, however small, approaches 
zero as n approaches infinity. An equivalent statement is that the prob- 
ability that 2/n differs from p by not more than any quantity, however 
small, approaches one as n approaches infinity. 


(f) Symbolize the last statement. (g) Derive it from (4). 


To prove (4) we note that x is a binomial variable and hence has mean 
np and variance npg (see Problems 4 and 5 in Section 8-6). Hence, 
from (1), 


(5) Plijx — np| > ἢ] < npq/t? 


holds for any t > 0. The substitution (¢:ne) yields P[|x — np| > ne] < 
npq/n*e?, which is equivalent to P{la/n — p| > €] < pg/ne?. Since the 
right member approaches zero as n — oo, the proof 1s complete. 


(h) Can we say from (4) that jz — πρὶ is likely to be small for large n? 
(i) Explain in detail the steps in this proof. (7) Relate (4) to Fig. 8-11. 


The law of large numbers (4) applies only to the binomial distribution, 
but it can be generalized. Suppose that v is a random variable with mean 
u. Suppose that we make n independent observations of v, %1, %2,... , Xn. 
Let = (a, + a@.+...+2,)/n. Then the probability that Ζ differs 
from » by more than any ε, however small, approaches zero as n ap- 
proaches infinity, 1.e., 

(6) lim P[|¥ — p| > e] = 0. 

This law is sometimes called the weak law of large numbers. We may view 
the n independent observations of v as the result of sampling with replace- 
ment from a population consisting of the values of v, the probability of 
choosing the values being given by the frequency function. Then (6) 
means that when we sample from a population, the mean of the sample 
may be expected to approximate the mean of the population, with the 
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probability of a deviation of any given size approaching zero as the sam- 
ple size increases. It is interesting to note that for (6) to hold all that is 
required is the existence of the mean. Granted this, the law holds regard- 
less of the nature of the distribution. The proof of (6) and the remaining 
theorems of this section are beyond the scope of this book. 

There are many laws of large numbers. One of them, known as the 
strong law of large numbers, asserts that 


(7) Pllim |z — p| = 0] = 1. 


hoe 


(k) Explain the difference between (7) and (6). (1) Relate (7) to Fig. 8-11. 


In a very large number of cases we can be much more specific about the 
nature of the distribution of the mean of a sample. Suppose that 24, 15, 
..., Xn are n independent observations of a random variable with mean μ 
and variance g”. Then asn — οὐ, 


vo bh 


« Ρ] -Ὁ F(b) -- 
(8) Pla < a | (6) — F(a), 


where Z is defined as above in (6) and F is the standard normal distribution 
function (8-8-6). 

It is this theorem and its generalizations that make the normal dis- 
tribution so valuable. No matter what the original distribution (provided 
it has a mean and variance), the mean of a sample tends to follow the 
normal distribution as the sample size is increased. Since probabilities 
associated with normal distributions are rather easily calculated, as we 
have seen in Section 8-8, it is much easier to deal with normal distributions 
than with the many different, and often unknown, distributions that may 
be encountered. But (8) tells us that if we are dealing with the mean of a 
large sample, we may use the normal distribution almost without regard 
to the original distribution. 

To illustrate (8) we consider a random variable with two values 0 and 1 
each with probability 1/2. Here » = 1: (1/2) + 0- (1/2) = 1/2, and 
o? = (1/2)?(1/2) + (—1/2)7(1/2) = 1/4. Now ζὶ = x/n, where z is the 
number of successes in n trials. Hence (Ζ — p)/(o/\/n) = (2/n — 
1/2)(2\/n). This expression should, according to (8), be approximately 
normally distributed with mean 0 and variance 1 for large n. 

Now we may view each half-line in Table V as the record of 25 inde- 
pendent trials, and the whole table as the record of 100 such experiments. 
Here n = 25 and z is the number of 1’s in each half-line. Corresponding 
to each x we have a value of z given by z = (x/25 — 1/2)(2\/25) or 
(x — 12.5)(0.4). In the hundred half-lines we count the frequency of 
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Relative Cumulative 


Frequency frequents eae F(z -+ 0.2) 


oqooeoococooooc”ntooococ co ©] 
mPreoonooocrcoocrodcocroeoonrec ©} 


6 
.2 
8 
4 
0 
6 
2 
0.2 
6 
0 
A 
8 
ἣν 
6 
.O 


οὐ eR © 


each value of z in order to obtain the above table, in which we have 
omitted the values for x < 5, since all entries are zero. The last column 
gives the cumulative probability for the standard normal distribution 
obtained from Table VI. These cumulative probabilities should cor- 
respond to the cumulative relative frequencies observed, if the distri- 
bution of x is to be approximately normal. We use F(z - 0.2) in order 
to get the cumulative probabilities corresponding to the upper end of 
each interval of values z. Since the successive values of z differ by 0.4, 
the point halfway between each pair of values is found by adding 0.2 to 
the smaller one. 

We note from the table that even for n as small as 25 and for only 100 
observations, the mean does appear to be distributed approximately 
normally. 

(m) Plot the last two columns against x on the same graph and connect each 
by a smooth curve. 

A special case of (8), known as the Laplace limit theorem is obtained by 
considering a random variable that takes only the values 1 and 0 with 
probability » and q respectively. Here » = 1-p+0-q=p, σΞ = 
(1 — p)?-p+ (0 — p)?+q = opt p’a = pda + }) = pg, and F = 
z/n, where x is the number of successes in 7 trials. Then 


(9) Ep ep fee 


o//n WV opq/n Vnpy 
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Applying (8), we have 


If x is the number of successes in n independent trials with 
constant probability p of success and ᾳ of failure, thenasn — οὐ 


(10) pla - 53:3 <5] Fe — Fe, 
V npg 


where F is the standard normal distribution function. 


That is, the normalized binomial distribution approaches the standard 
normal distribution as the number of trials increases. The word “normal- 
ized” refers to subtraction of the mean np and division by the standard 
deviation ~/npq. 

The Laplace theorem enables us to use the right member of (10) as 
a convenient approximation to the left when n is large. For example, 
what is the probability of observing between 45 and 55 heads if an evenly 
balanced coin is tossed 100 times? Here n = 100, p= q = 1/2. Itis 
convenient to rewrite the inequalities in (10) as aV/npg-+np <2 < 
b\/npg + np. Here \/npq = 5, np = 50, δα + 50 = 45, 5b + 50 = 55, 
a= —1,b= 1. From Table VI, F(b) = 0.841, F(a) = 0.159, and the 
desired probability is 0.682. 


For the same situation, estimate the probability that the number of successes 
lies between; (n) 49 and 51, (0) 45 and 50, (ρ) 42 and 33. 


PROBLEMS 


1. Figure 8-11 was made by counting the number of successes (1’s) in Table 
V starting at the left of the first row and continuing in the usual reading order. 
It was necessary to calculate 2500 relative frequencies and plot them. Calcu- 
late the first 50 relative frequencies starting on the right of the last row and 
reading backward. Plot your results. 

2. Reading the table backward as in Problem 1, find 2/n for n = 10, 20, 
30, 40, 50, 60, 70, 80, 90, 100, and plot your results. 

3. Derive (4) from (6) by considering the binomial distribution as generated 
in the following way. Let v be a random variable having just the values 1 
(success) and 0 (failure), with probabilities p and g. Then take n observations 
of v and apply (6). 

4. Show that with the same hypothesis as (10), P[zy < x < 22] — F(b) — 
F(a), where a = (αι — np)/Wnpq and ὃ = (x2 — np)/Vnpq. 

5. A die is tossed 180 times. What is the probability that the face six ap- 
pears exactly 30 times? (Suggestion: Let αὶ = 29.5 and x2 = 30.5, and use 
the previous problem.) 

6. Let x be the number of successes in a whole line of Table V. Make a table 
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of the observed frequency distribution of x and compare it with the normal 
distribution as we did in the table following (8). 

7. Do the same for z the number of successes in each group of 10 digits in 
Table V. 

8. A student pilot named Brom passed the time by tossing 10 nickels 40 
times. Letting x be the number of heads appearing in a toss, the observed 
distribution was as follows: 


Frequency 
x 


Compare this with what might be expected by finding cumulative relative 
frequencies and comparing them with those for the normal distribution as we 
did in the section. Do you think that the nickels were biased toward heads? 
9. A die was tossed 100 times. Six came up 25 times. What is the probability 

that for a balanced die, six would come up 25 or more times out of 100? 

10. Two dice were tossed 50 times. Seven appeared only five times. What is 
the probability that seven would appear five or fewer times if the dice were fair? 

11. In order to discover the average height of 1000 students, a random sam- 
ple of 16 students was drawn. Suppose the true average height to be 5.4 ft and 
the standard deviation of the heights to be 0.4 ft. What is the probability that 
the mean of the sample will differ from the true mean by more than 0. 1ft? 

12. Answer the same question for a sample of 25. 

13. For a sample of 100. 

14. Answer the questions for Problems 11, 12, and 138 without the informa- 
tion about the true average height. 

15. Referring again to Problem 11, how ihe one guess an approximation 
to the standard deviation? 
%* 16. For an explanation of the fact that the graph in Fig. 8-11 remains on 
one side of the center line for long periods, see Feller, An Introduction to Prob- 
ability Theory and Its Applications, Vol. 1, Second Edition, Chapter ITI. 


ANSWERS TO EXERCISES 


(a) The first few relative frequencies are 0, 1/2, 2/3, 3/4, 4/5, 5/6, 6/7, 7/8, 
7/9, 8/10, 9/11, 9/12, 9/18, 9/14, 9/15, 9/16. Note that the observed relative 
frequency may not always move toward the expected. The last relative fre- 
quency is 29/50. (Ὁ) Each squared term is >¢ since we limited the sum to 
such values. The sum of the probabilities over such values is Just the prob- 
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ability that [x — yw] >t. (c) The sum in (2) is replaced by an integral from 
uw —ttou-+ t. The rest of the proof is the same with summations replaced by 
integrals. 

(d) It must be less than or equal to 1/9. (6) For a normal distribution the 
probability is 0.002. The normal distribution is very special, whereas 
Chebyshev’s inequality applies to any distribution with mean and variance. 
(Ὁ Ve Pllz/n — p] <e] τ lasn—o. (6) (8-2-6). (8) No, it is likely 
to be very large. (i) |x — np| > ne |x/n — Ὁ}, De sincen > 0. (ἢ If 
we made many such charts and picked an e, we would find that the proportion 
in which 2/n differed from u by more than this ε would be very small for large n. 

(k) The operations of taking the limit and the probability are interchanged. 
The first tells us merely that the probability of any difference approaches 0, 
the second that the probability that the difference approaches 0 is 1. The weak 
law asserts merely that for all n sufficiently large, the probability of any given 
deviation is arbitrarily small. However, larger deviations are still possible, no 
matter how large n is, and hence it does not follow that 7 > uw 85 ἢ -- o, 
The strong law asserts that Ζ — uw almost always, 1.e., with the probability 1. 
Its failure is less likely than that a monkey would type out Webster’s unabridged 
dictionary without a mistake if it were given a typewriter! The strong law 
shows that the theory corresponds to experience and to our intuitive idea of 
probability. In terms of the experiment with which this section begins, the 
weak law asserts that if we draw horizontal lines at a distance ε above and be- 
low the 0.5 line and imagine the experiment continued indefinitely, we expect 
the graph to cross these lines less and less often as n gets large. The strong law 
asserts that in practically all such experiments, the graph approaches the 0.5 
line asymptotically. | 

(1) If we made many such charts, in virtually all of them % would approach 
wasn— oo, (m) The curves cross between 10 and11. (n) 0.158. (0) 0.341. 
(p) 0.671. 


ANSWERS TO PROBLEMS 


1. The last is 21/50. 3. See derivation of Laplace limit theorem. 5. 0.08. 
7. The frequencies of x = 0, 1,..., 10 are 0, 0, 6, 36, 54, 60, 51, 35, 7, 
1. 9. About 0.01. 11. About 0.32. 13. 0.013. 


*CHAPTER 9 
STATISTICAL INFERENCE 


9-1 The meaning of “‘statistics.’ When first introduced in the eight- 
eenth century, “statistics” meant information about governments, and as 
late as 1885 the London Statistical Society proposed to define it as “the 
science which treats of the structure of human society.” However, before 
the middle of the nineteenth century, “statistics” was used also as a new 
name for the systematic collection and study of numerical data on popu- 
lations, which had been called “political arithmetic” in the seventeenth 
century and is now named “vital statistics.” Gradually the meaning was 
broadened to include all numerical data, and today “statistics” as a plural 
refers to numerical information in any field. “Descriptive statistics” 
(singular) now means the science and art of collecting, interpreting, sum- 
marizing, and presenting numerical information. 

Where it is possible to collect all the information about the phenomena 
in which we are interested, descriptive statistics is adequate. For example, 
if we are interested in the scholastic performance of the students in a 
certain college in a certain year, we may be able to get complete informa- 
tion from the registrar. However, in many situations complete informa- 
tion is denied us. This may be due merely to limitations of time and 
expense. For example, it would be possible to question each person in the 
United States at regular intervals to find out the number of unemployed, 
but the expense would be prohibitive. In other cases, gathering complete 
information is not even theoretically possible because it would require 
the observation of an infinite number of instances, often including some 
in the future. For example, we can never observe all freely falling objects 
to see whether they follow Newton’s laws of motion, nor can we observe 
an infinite sequence of throws of a coin to see whether it follows the law of 
large numbers. In short, we often have a population of which only a sam- 
ple can be observed. We nevertheless wish information about the popula- 
tion as a whole. Since our information is incomplete, we cannot make 
statements about the population with certainty. The best we can do is to 
work out procedures for making uncertain but useful inferences about 
the population. The process of drawing conclusions about a population 
from knowledge of a sample is called statistical inference, and the science 
and art of making such inferences is called inferential stattstics. 

Any statement about a population is either true or false. However, 
different procedures of statistical inference have different probabilities of 
yielding correct statements under various conditions. It is these probabili- 
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ties that enable us to choose desirable procedures for statistical inference. 
Because of this the theory of statistical inference, also called mathematical 
statistics, becomes a special branch of probability theory. In its present 
form it is a product of the twentieth century, and especially of the last 
two decades. 

Modern statistics, both descriptive and inferential, is an enormous 
field in terms of the number of people who are penteaeiorial statisticians, 
the extent and rate of growth of the theory, and the increasing role that it 
plays in government, industry, and science. In this chapter we attempt 
nothing more than a very brief and incomplete survey of the most im- 
portant theoretical ideas of statistical inference. 


9-2 Testing statistical hypotheses. Imagine an urn containing 10 
balls, of which some may be red and the rest green. We are not permitted 
to inspect the urn and count the balls. Our only means of gaining infor- 
mation is random sampling with replacement; i.e., we may remove one 
ball, note its color, replace it, and repeat in such a way that any ball is 
as likely to be picked as any other. This means that if the proportion of 
red balls is p, the probability of picking a red ball is p and of picking a 
green ball is 1 — p. Now suppose that the claim is made that p = 1/2. 
We wish to test this hypothesis in order to be able with some justification 
to accept or reject it. Our problem is to decide on a good procedure for 
this purpose. 

This may seem to be a rather unimportant problem, but consider the 
following “real life” situations: 

A political scientist wishes to discover the proportion p of voters among 
a voting population of 10,000,000 favoring a certain candidate. He can- 
not ask all voters, so he must pick a sample and adopt a procedure for de- 
ciding about p on the basis of information provided by the sample. 

A chemist wishes to determine the proportion p of a certain compound 
in a mixture. He cannot analyze the entire mixture, so he takes a sample 
and tries to determine p from the sample. 

A biologist wishes to determine what proportion p of a certain type of 
animal have a given characteristic. He cannot observe all animals of this 
type, so he must resort to sampling. | : 

Evidently, the problem of the urn is a simplified picture of a kind of 
problem that arises in practically all cases where we wish to secure infor- 
mation by observations. 

To further simplify the urn problem let us agree first to consider only 
those procedures based on observing just four balls. Let x be the number 
of red balls observed. Then the possible outcomes of our experiment are 
x = 0, 1, 2, 3, or 4. Since this is all our information, we must decide on a 
rule that tells us whether to accept or reject the hypothesis p = 1/2 on 
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the basis of the value of z observed. Common sense suggests that x = 2 
is favorable to our hypothesis and x = Ὁ or x = 4 are unfavorable. We 
may agree, then, to accept if x = 2, and reject if x = 0 or 4. But where 
shall we draw the line? What shall we say if « = 1 or x = 3? To see 
how one could answer this question, we adopt a rule and work out the 
consequences. 

Suppose that we agree to accept if « = 2, and reject otherwise. Let 
us see what happens in all possible situations. Suppose, first, that our 
hypothesis is really true, that is, p = 1/2. Then the probability that 
x = 2 is 3/8 by (8-6-7) (n:4, x:2, p:1/2, ¢:1/2). This means that when 
our hypothesis is true, the probability of acceptance is 3/8 and of re- 
jection is 5/8. In other words, if the hypothesis were true and we made 
many experiments of this kind to test it by this rule, we should accept it 
3/8 of the time and reject it 5/8 of the time. 

By considering the other values of p and calculating the probability of 
acceptance, P[z = 2], in each case, we get the following table, in which 
probabilities are rounded to two decimal places. 


Probability 
True p of Rejection 
acceptance 
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This test procedure has the obvious disadvantage that when the hypoth- 
esis is true we reject it more often than when we accept it. It has the ad- 
vantage that when the hypothesis is false we are very likely to reject it. 
Figure 9-1 shows that the probability of rejection is smallest when the 
hypothesis is true and increases as the true p deviates more from p = 1/2. 


(a) Suppose we adopt the following test of the hypothesis p = 1/2. Take 
a sample of four with replacement, accept if x = 2 or x = 3, and reject other- 
wise. Make a table like (1) and graph the probability of rejecting the hypothe- 
sis as a function of p. Do you think this is a better test? | 
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FIGURE 9-1 


It is convenient to adopt some special terminology. First we note that 
in the above situation there is a random variable (the number of red balls 
observed) with a probability distribution about which we do not have 
complete information. We know that the distribution is binomial, but p 
is unknown. A statement about the distribution function of a random 
variable is called a statistical hypothesis. Above, our statistical hypothesis 
isp = 1/2. 

Two things should be noted here. First, this use of “hypothesis” is 
different from that in logic, although a statistical hypothesis, being a sen- 
tence, may serve as the sufficient condition in an implication. Second, in 
scientific discourse any statement that is tentative may be called a hypoth- 
esis, but not all such statements are statistical hypotheses. A statistical 
hypothesis must refer to the distribution of a random variable. Since 
most quantitative observations in science involve random elements (for 
example, random errors of observation), many scientific hypotheses can 
be formulated as statistical hypotheses even when this is not obvious on 
the surface. For example, the hypothesis that the acceleration of gravity 
is 32 {t/sec/sec may be formulated as a statement about the distribution 
of the acceleration observed in a certain experiment. On the other hand, 
the statement “Mr. Jones’ age is 28” can be verified or rejected by ob- 
taining factual evidence not subject to random variation, and “All V-8’s 
have eight cylinders” is a statement that can be established by logic with- 
out reference to observations. Neither are statistical hypotheses. 


Give further examples of hypotheses that: (Ὁ) are statistical, (6) are 
not statistical. 


A test of a statistical hypothesis is a procedure for deciding on acceptance 
or rejection of the hypothesis. In the above example our test is: accept if 
x = 2, reject otherwise. 

In our example, there are a number of alternatives to the hypothesis 
tested. Each of these is also a statistical hypothesis. The rejection of the 
hypothesis tested amounts, then, to acceptance of the disjunction of the 
alternatives. In the example, we test the hypothesis that x is distributed 
binomially with p = 1/2 against the alternatives that it is distributed 
binomially with p = 0, 0.1, 0.2, etc. The claim that z is binomially dis- 
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tributed is not questioned. We call the set of hypotheses considered the 
admissible hypotheses. In our case they are specified by 11 values of a 
single parameter p. 

Setting up a test of a statistical hypothesis about a random variable 
amounts to partitioning the sample space (set of values of the random 
variable) into two subsets such that if the observation falls in one, the 
hypothesis is accepted, and if it falls in the other, the hypothesis is re-. 
jected. The critical region of a test is that part of the sample space in which 
an observation calls for rejection. In our example, the sample space is 
{0, 1, 2, 3, 4}, and the critical region is {0, 1, 3, 4}. Evidently, the critical 
region determines the test. 

We note above that there are two ways in which we may commit an 
error by using a certain test. First, we may reject the hypothesis when it 
is really true. This is called an error of Type I. The probability of the 
Type I error in our example is 0.62. On the other hand, we may accept 
the hypothesis when it is really false. This is called an error of Type 1]. 
In our example, the probability of Type II error depends on which one of 
the alternative hypotheses is true, and it is given by the entries in the 
acceptance column in (1), excluding the entry opposite p = 0.5. If the 
hypothesis tested consisted of more than one value of p, we should have 
more than one probability of error of Type I also. 

Evidently we should like to use tests in which the probabilities of both 
types of errors were as small as possible. But a little reflection shows that 
such hopes cannot be satisfied. We can make the probability of Type I 
zero by adopting the test: accept no matter what the observation. But 
then the probability of Type II is 1! Similarly, we can make errors of 
Type II impossible at the cost of making errors of Type I certain. It 
appears that a compromise is necessary. Each compromise leads to a dif- 
ferent test, and the most desirable compromise depends on circumstances. 

One procedure is to decide on a satisfactory value for the probability 
of an error of Type I, to consider only those tests that keep the prob- 
ability below this value, and among these to choose the test that makes 
the Type II error as small as possible. The probability of Type I error is 
called the level of significance of the test. Usually it is chosen on the order 
of 0.05 or even 0.01, so that the probability of rejecting a true hypothesis 
is very small. 


For each of the possible critical regions indicated, find the probability of 
Type I error, i.e., the level of significance of the corresponding test. (d) {0, 1, 
2, 8,4}. (e) {1,2,3,4}. (ὃ {0,2,3,4}. (g) {0,1,3,4}. (Ὁ) {0, 1, 2, 4} 
(i) {0,1,2,3}. 6) (2,8,4}. (ἡ {1,8,4}. Ο {1,2,4}. (m) (1, 2,3} 
(n) (0, 8,4}. (0) (0, 2,4}. ) (0, 2,8}. ( {0,1,4}. (ἡ (0,1, 3}. 
(8) {0,1,2}. (Ὁ {0,1}. (a) {0,2}. () (0,8}. (w) {0,4}. (x) {1,2} 
(y) {1,3}. (2) {1,4}. (aa) {2,3}. (bb) {2,4}. (00) (8,4). (dd) {0} 
(ee) (1. (ff) (2. (gg) {3}. (bh) {4}. Gi) ὁ. 
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Calculations such as those above tell us all we need to know about the 
errors of Type I associated with the possible tests. To consider errors of 
Type IT, it is convenient to introduce the notion of power. The power of a 
test is the probability that it rejects the hypothesis. The power of the 
test of our example is given in the rejection column of (1). The power 
function of a test is the function that maps the admissible hypotheses 
into the corresponding probabilities of rejection of the hypothesis tested. 
The first and last columns of (1) are a table of a power function, and Fig. 
9~] is its graph. 

Let P be the power function of a test, H an admissible hypothesis, and 
Hy the hypothesis tested. Then P(H) is the probability of rejecting Ho 
when # 15 true. In particular, P(H9) is the level of significance. When 
H # Ho, P(H) is 1 minus the probability of Type II error, i.e., it is the 
probability. of correctly rejecting the hypothesis Ho. A very desirable 
power function would be one with P(H 9) as small as possible and P(H) 
otherwise as close to one as possible. 


(1) Graph the power functions of those tests in Exercises (4) through (ii) 
that have a level of significance less than or equal to 0.63. Begin by making a 
table of the probability of each value of x for each value of p, using your results 
from Exercises (g) through (0) in Section 8-6. Add appropriate entries to find 
the probabilities of an observation lying in each critical region for each value 
of p as we did in the last column of (1). 

In relation to your results in (jj): (kk) of the three tests with level of 
significance 0.63, why is (g) the best? (1) Can you answer the same question 
for those with level of significance 0.56? (mm) 0.50? (nn) 0.44? (00) 0.38? 
(pp) 0.312 (qq) Comment on the others. (rr) What would you consider 
the best test, everything considered? (85) What is your choice if it is very 
important to detect p > 1/2 but not p < 1/2? ((0) if it is important to 
detect p different from 1/2 by more than 0.2 but not important to detect smaller 
differences? 


One of the principal tasks of the statistician is the choice of tests that 
are appropriate in practical situations. The previous example illustrates 
how this is done by studying the power functions of the possible tests. 
The proper choice requires a knowledge of the situation and of the actions 
that may be based on rejecting or accepting the hypothesis. 

In the example of this section, our experiment consisted of four random 
drawings with replacement. This experiment limited our possible tests 
to those considered. It is evident that better tests could be devised if we 
took a larger sample. Indeed, common sense suggests that the larger the 
sample, the more information we may obtain. Perhaps, too, we might do 
better by sampling without replacement, if that were permitted. Indeed, 
it is obvious in the case considered that a sample of 10 without replace- 
ment would give us complete information! In more complex situations 
the variety of possible experiments would be very great. 
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It should be observed that changing the experiment may change the 
statistical hypothesis being tested. In the example of the balls in the urn, 
increasing the size of the sample introduces a different random variable 
and a different binomial distribution, even though we may still state our 
hypothesis in the form p = 1/2. It should be clear, then, that the design 
of experiments, the choice of tests, and the formulation of hypotheses are 
intimately interrelated. It is for this reason that statistical problems 
should be considered by scientists before rather than after they perform 
experiments. Often what appears to be a good experiment turns out to 
be useless because it was not designed in such a way as to permit a satis- 
factory test of a meaningful hypothesis. We cannot pursue this matter 
further here. 


PROBLEMS 


The first ten problems refer to the urn discussed in the section. The prob- 
abilities calculated in the exercises may be useful. 

1. Suppose that we must judge whether or not p = 1/2 by observing just. 
one ball. What is the sample space? What are the possible critical regions? 
What is the best test? Graph its power function. 

2. Answer the same questions if a random sample of two balls is all that is 
permitted. 

3. Design a good test of the hypothesis p = 0.6 with a sample of four mem- 
bers. Plot its power function. 

4. Design what you consider the best test for the hypothesis p = 0.8 if 
it is particularly important to detect p > 0.8. (This might be the case, for 
example, if it were particularly important to reject a medicine with too high a 
concentration of a dangerous ingredient.) Graph the power function. 

δ. Design a good test for the hypothesis 0.4 < p < 0.6. 

6. Do the same for p < 0.2. 

7. While increasing the sample size evidently improves the best tests, it is 
evident that the calculation of the power function would become more difficult. 
Illustrate by finding thé level of significance and a few other values of the 
power function for a test of p = 1/2 based on a random sample with replace- 
ment of 25 balls, where the critical region is all points except x = 11, 12, and 13. 

8. When the sample size is large, we may use the central limit theorems of 
Section 8-9. Suppose, for example, that we test p = 1/2 by a sample of 100 
observations. Experience with small samples suggests accepting the hypothesis 
if the observation lies in an interval whose center lies at x = 50, 1.e., taking as 
a critical region (50 — a 50+ a)’ or (—~ _50—a)U(50-+a_%~). If 
e is our desired level of significance, we choose a so that for p = 1/2, 1 — P 
[50 —a <2 < 50+] =e. Finding a is made easy by using the Laplace 
limit theorem (8-9-10), which tells us that (2 — 50)/5 is approximately nor- 
mally distributed. We have 50 — a <x < 50+ α «Ὁ» —a/5 < (ὦ — 50)/ 
5 < a/5. But by the Laplace theorem, P[—a/5 < (a — 50)/5 < a/5] = 
F(a/5) — F(—a/5) = 2H(a/5). For a level of significance of 0.05 we find a/5 
from Table VI so that 2H(a/5) = 0.95. Here a/5 = 2 ora = 10. What is 
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the critical region? What is the probability of error of Type I? Graph the 
power function of the above test by using the normal approximation and Table 
VI for each admissible value of p. 
9. Repeat Problem 8 for a level of significance of 0.01, and graph the power 
function. 
10. Same for level of significance of 10%. 


%11. An election is to be predicted by a sample of 10 voters. Suppose that a 
level of significance of 0.01 is desired for a test of the hypothesis that the vote is 
even. Find a critical region. What will be the probability of detecting a senti- 
ment of 54 or more percent in favor of one of the two candidates? 
* 12. Answer the same questions if we use a test with only 0.05 as level of 
significance. 

13. A coin is tossed 100 times and comes up 55 times heads. Is the coin evenly 
balanced? 

14. In the situation of Problem 9 in Section 8-9, is the die loaded? 

15. In the situation of Problem 10 in Section 8-9, are the dice loaded? 
*%16. In the situation of Problem 13 in Section 8-9, suggest a test for the 
hypothesis that the true average height is 5.4 ft. 
%* 17. Some people claim that after a run of several heads a coin is more likely 
to come up tails. Treating Table V as a record of 2500 throws of a balanced 
coin, test the hypothesis that following a run of five heads, the probability of 
heads on the next throw is 1/2. 
*18. Test the hypothesis that p = 1/2 in Table V by considering the first 
line as a random sample and letting the level of significance be 0.68. 
*19. Carry out a similar test for each line in Table ἡ. In what proportion of 
the cases is the hypothesis rejected? 


ANSWERS TO EXERCISES 


(a) See the answer to (q) and to part (q) in Exercise (jj). (d) 1. (6) 0.94. 
(f) 0.75. (g) 0.62. (8) 0.75. (i) 0.9. (j) 0.69. (k) 0.56 (1): 0.69. 
(m) 0.86. (n) 0.38. (0) 0.50. (p) 0.69. (q) 0.38. (r) 0.56. (8) 0.69. 
(t) 0.31. (u) 0.44. (v) 0.31. (w) 0.183. (x) 0.62. (y) 0.50. (z) 0.31. 
(aa) 0.62. (bb) 0.44. (cc) 0.31. (dd) 0.06. (ee) 0.25. (ff) 0.38. 
(gz) 0.25. (hh) 0.06. (ii) 0. 


This entire paragraph relates to Exercise (jj). We give 11 powers correspond- 

ing top = 0,0.1, ...,1.0 (g) See (9-2-1). 

(k) 0, 0.30, 0.44, 0.49, 0.52, 0.56, 0.63, 0.73, 0.84, 0.95, 1. 
(n) 1, 0.66, 0.44, 0.32, 0.31, 0.38, 0.50, 0.66, 0.82, 0.95, 1. 
(o) 1, 0.71, 0.56, 0.51, 0.50, 0.50, 0.50, 0.51, 0.56, 0.70, 1. 
(q) 1, 0.95, 0.82, 0.66, 0.50, 0.38, 0.31, 0.82, 0.44, 0.66, 1. 
(r) 1, 0.95, 0.84, 0.73, 0.63, 0.56, 0.52, 0.49, 0.44, 0.30, 0. 
(t) 1, 0.96, 0.82, 0.65, 0.47, 0.31, 0.18, 0.08, 0.03, 0.00, 0. 
(u) 1, 0.70, 0.56, 0.50, 0.48, 0.44, 0.37, 0.27, 0.15, 0.05, 0. 
(v) 1, 0.66, 0.44, 0.32, 0.28, 0.31, 0.37, 0.42, 0.42, 0.29, 0. 
(w) 1, 0.66, 0.41, 0.25, 0.16, 0.12, 0.16, 0.25, 0.41, 0.66, 1. 
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(x) 0, 0.34, 0.56, 0.68, 0.69, 0.62, 0.50, 0.34, 0.18, 0.05, 0. 
(y) 0, 0.30, 0.44, 0.49, 0.50, 0.50, 0.50, 0.49, 0.44, 0.30, 0. 
(z) 0, 0.29, 0.41, 0.42, 0.37, 0.31, 0.28, 0.32, 0.44, 0.66, 1. 
(aa) 0, 0.05, 0.18, 0.34, 0.50, 0.62, 0.69, 0.68, 0.56, 0.34, 0. 
(bb) 0, 0.05, 0.15, 0.27, 0.37, 0.44, 0.48, 0.50, 0.56, 0.70, 1. 
(cc) 0, 0.00, 0.03, 0.08, 0.18, 0.31, 0.48, 0.65, 0.82, 0.95, 1. 
(dd) 1, 0.66, 0.41, 0.24, 0.13, 0.06, 0.03, 0.01, 0.00, 0.00, 0. 
(ee) 0, 0.29, 0.41, 0.41, 0.35, 0.25, 0.15, 0.08, 0.03, 0.00, 0. 
(ff) 0, 0.05, 0.15, 0.26, 0.35, 0.38, 0.35, 0.26, 0.15, 0.05, 0. 
(gz) 0, 0.00, 0.03, 0.08, 0.15, 0.25, 0.35, 0.41, 0.41, 0.29, 0. 
(hh) 0, 0.00, 0.00, 0.01, 0.03, 0.06, 0.13, 0.24, 0.41, 0.66, 1. 
(ii) 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0. 


(kk) It has greatest power, except for one case in relation to each of (u) and 
(x), and there its power is almost as high. (1) No, one is better for p > 1/2, 
the other for p < 1/2. (mm) Test (0) is better even though 2 belongs to criti- 
cal region. (nn) Like (ll). (oo) Either (n) or (q) is better than (ff), but 
they are like (11). (pp) (t) is best for p < 1/2; (ec) for p > 1/2; other two 
might be good compromises. (qq) All have low powers, except (w), which 
appears to be a good test except that it does not have high powers for p near 1/2. 
(rr) No unique answer. Two candidates are (w), (g). A really satisfying test 
doesn’t seem possible with such a small sample. (ss) (cc) or (z). (tt) (w). 


ANSWERS TO PROBLEMS 


1. {0, 1}; {0, 1}, {O}, {1}, ὦ. Either {0} or {1} is better than others. See 
Fig. 9-2 for graph of power function of {0}, the best for testing p = 1/2 against 
p < 1/2. 3. Suggestion: {0, 1, 2,4} appears to be plausible. See Fig. 9-3 for 


the power function. To be sure of our choice we should investigate other tests. 


0 0.6 1.0 
FIGuRE 9-3 
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5. {0,4}. 7. Level of significance is 1 — [C?? + C?3 + C#3)(1/2)7>. 9. See 
Fig. 9-4. Critical region is (87 63)’. 18. Yes at 0.05 level of significance (see 
Problem 8). Yes at any level of significance <0.32. 15. If we test the hypothe- 
sis that the dice are fair, as above, the answer is yes at 0.05 significance level. 


9-3 Estimation. In the previous section we approached the problem of 
incomplete information by formulating and testing specific hypotheses. 
An alternative approach is to design procedures for estimating the value 
of an unknown. Instead of asking whether the hypothesis p = 1/2 is true 
about the balls in the urn, we might adopt a formula to give an estimate 
of p in terms of our observations. 

A method of estimation that seems very natural in this example is to 
observe the number z of red balls in a sample of n balls obtained by random 
sampling with replacement, then adopt x/n as an estimate of p. Why is 
this reasonable? For one thing, the expected value of x/n is p, since 


(1) Elx/n] = 2X, (&/n) f(x) (8-6-6) 


(2) = (1/n) Vaf(x) (6-77-18) 
(3) = (1/n)E{[z] (8-6-6) 
(4) — (1/n)np = » _ (Problem 4 in Section 8-6). 


Of course, in any particular sample x/n may differ considerably from 
the true value of p, but this shows that at least p is its average value over 
many samples. This suggests, of course, taking many samples and aver- 
aging the observed values of x/n. Such a procedure is equivalent to simply 
taking a larger value of n in a single sample. 

To generalize, suppose a random variable v has a distribution function 
involving an unknown parameter @. In the urn problem this parameter 
isp. Let 21, 15, ..., Xn be n observed values of v obtained by a random 
sample with replacement. (This is not the only way of observing values 
of the random variable, but we consider this possibility for the sake of 
simplicity.) Let @ be a function of 2), %2, ..., ἅμ. Then we call 6 an’ 
estimator of 6, and 6(x,, 15, ... %) an estimate of 6. In the example of the 
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urn, Ὁ is ἃ random variable that takes the values 1 and Ὁ with probability 
p and 1 — p, 1 corresponding to an observation of red and 0 to green. 
If 2 is the number of red balls observed in a drawing of n balls with re- 
placement, then p(x, ...%) = «/n = (4, + %o +... %y)/n is the esti- 
mate mentioned above. 

When the expected value of an estimate is equal to the parameter being 
estimated we called the estimate unbiased. We showed above that x«/n 
is an unbiased estimate of the proportion of red balls in the urn. More 
generally, suppose that a random variable has frequency function f with 
unknown mean uw. Let = (x1 + te +... + χη) η be the mean of n 
independent observations of v. Then it can be proved (though not without 
a little more theory about expectations than we have covered) that 
E[z| = μ, 1.e., the mean of the sample is an unbiased estimate of the mean 
of the population. 

Estimates of this type are called point estimates, for the following reason. 
We may consider the possible values of the unknown parameter as a set of 
points on a real axis. An estimator gives us a single value of the parameter, 
1.6., a single point. 

Obviously, there are many different formulas for estimating a param- 
eter. For example, to estimate the mean of a distribution, we might take 
the average of the largest and smallest values of the variable observed, or 
the value most frequently observed (the mode of the sample), or the value 
with an equal number of larger and smaller observed values (the median 
of the sample). 

(a) Suppose one estimator is unbiased but frequently has values very much 
larger or very much smaller than the true value (these values balancing to give 
a mean equal to the true value). Suppose another were biased but had an ex- 
pected value very near the true value and had no possible values differing widely 
from the true value. Which would pe preferable? 

From the definition of an estimate, it is evident that an estimate is a 
random variable with a distribution related to the distribution under 
consideration. Exercise (a) suggests that it is desirable for the variance of 
the distribution of an estimate to be small. Indeed, it may be better to 
use a biased estimate with small variance and expectation near the true 
value of the parameter than to use an unbiased estimate with large vari- 
ance. Evidently we must choose between estimators on the basis of their 
properties and our needs just as was the case for tests. Lack of bias and 
small variance have been mentioned as desirable properties of point esti- 
mates. There are many others, some of which will be mentioned in the 
problems. It is usually impossible to find the best estimate from every 
point of view, and a compromise 15 required. 

When we make a point estimate, we do not imagine that it gives the 
true value of the parameter exactly, except by chance. For example, in 
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the problem of the ten balls in an urn, if the true value of p is 1/2, the 
probability that z/n = 1/2 for a sample with n = 4 is 0.3750. On the 
other hand the probability that 1/4 < x/n < 3/4 is 0.8750. If we were 
to take a sample of size 10, P[z/n = 1/2] = 0.25, which is still smaller. 
But P[0.4 « 2/n < 0.6] = 0.65, and P[0.3 < 2/n < 0.7] = 0.89. In 
short, the estimate is likely to be near the true value; and as we increase 
the size of the interval around the true value, the probability that our 
estimate is within the interval increases. 


(b) Check the above calculations. (c) Find the same probabilities for a 
sample of size 100. 


These considerations suggest that instead of determining a single value 
of the unknown parameter, we find an interval within which we may con- 
fidently assert that the true value les. Of course, the longer we make the 
interval, the greater is our confidence and also the less the precision of our 
statement. Hence, as always in statistics, a compromise 15 necessary. 

To make this more precise, let us suppose that we are estimating p in 
the problem of the ten balls in an urn by means of a sample of 100 observa- 
tions. Now we know that (x — np)/\/npq is approximately normally 
distributed. Hence 


(5) Pla < ( — np)/V npg < Ὁ] = F(b) — F(a), 


where F is the standard normal distribution function. This says that for 
a given p, g = 1 — p, and n, the probability that the observed z lies 
between the given limits, is as indicated. It seems natural to consider an 


interval with center at np, that is, to let a = —b = z. Then 
(6) Pi—z < ὦ — np)/V npg < 2] = 2H(2), 


where H(z) is found from Table VI. 
The inequalities in (6) may be written 


(7) —zVnpg < « — np < zV npg, 
(8) np — zVnpq < x < np + τν npg. 


Clearly, (6) gives the probability that the observed z lies within a certain 
number z of standard deviations ./npq from the mean np of the dis- 
tribution. 

Since we should like our interval to give a high degree of confidence, we 
begin by choosing z in (6) to accomplish this. For z = 2 we have 


(9) P[—2 < (x — np)/V npg < 2] = 0.95. 
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For a given n and an observed x there will be some values of p that satisfy 
the inequality and some that do not. It is evident that when p — 0 or 
p—1, and x τέ np, (x — np)/V/npqg— 40; and when p = 2/n, 
(x — np)//npq = 0. This suggests that there are values of p, say p; and 
>, between which the inequality is satisfied and that these values are just 
those that are given by 


(10) —2= (@—np)/Vnpq and (ὦ — np)/Wnpq = 2. 


Substituting g = 1 — p and squaring to eliminate radicals, we get from 
either equation 


(11) (x — np)” = 4np(l — p), 
which simplifies to 
(12) (n? + 4n)p? — (Qnx + 4n)p + x? = 0. 


This can be solved for p to give us the two desired values ρὲ and po. In 
our particular case, ἢ = 100. If x = 50, the roots of the equation are 
approximately 0.4 and 0.6. 


(4) Check that the solutions do satisfy (12) approximately. (6) Carry 
through the algebra to find them. (ἢ) Repeat the above reasoning but with z 
chosen to make the probability in (9) approximately 0.68. (g) Show that 


(18) [—z < (x — np)/V npg “ 2] [p1 S pS pel, 
where p; and ρ are the roots of 

(14) n(n + 22)p? — n(2x + 22)p + 2? = 0. 

(h) Hence show that p; and po so defined are such that 

(15) Pipi S pS pe] = 2H (2). 


We have shown that the interval (p1; pe), whose endpoints are roots 
of (14), has a probability 2H(z) of containing the true value of p. It is 
important to keep in mind that p is not a random variable, but is fixed 
(though unknown). The probability in (15) is the probability that p; 
and pz defined by (14) do enclose the mean. These endpoints are random 
variables, since they depend on x. By choosing z to get the desired prob- 
ability in (15) we have a method of determining an interval within which 
the true value may be asserted to lie with a known probability that the 
statement will be true. 
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An interval within which an unknown parameter is asserted to lie is 
called a confidence interval. Its endpoints are called confidence limits, 
and the probability that the interval contains the true value is called 
the confidence coefficient. In the above example the confidence interval is 
(0.4 0.6), the confidence limits are 0.4 and 0.6, and the confidence 
coefficient is 95%. 

The confidence coefficient gives the probability that the interval con- 
tains the true value in the following sense. If we make many such experl- 
ments, from each finding p; and ps. in the same way, then the relative 
frequency of the occasions in which these limits inelude the true p between 
them will be nearly equal to the confidence coefficient. The parameter p 
is, of course, not a random variable. The random variable is x, and its 
different values determine different intervals. The meaning of (9) is that 
about 95% of the time the determined interval does include the true value 
of p. Hence, if we use this method of interval estimation we shall be 
making true statements 95% of the time. If the confidence coefficient is 
a%, we call the endpoints of the confidence interval a% confidence 
limits. 

With n = 100, find 95% confidence limits for pif: (4) « = 60, (1) x = 80, 
(Κ) = 20. 

(1) Repeat Exercise (i) for 68% confidence limits. (m) Repeat Exercise (i) 
for 99% confidence limits. 

When parameters of other distributions are being estimated, the details 
of finding the confidence limits are different, but the essential idea is the 
same. We always adopt a procedure that results in true statements a 
certain desired percentage of the time. This type of estimation is known 
as interval estimation as opposed to point estimation discussed earlier in 
this section. 


_ PROBLEMS 


1. To estimate the total weight of 2000 men who are to board a ship, a sample 
of 100 men are weighed and their average weight found to be 165 lb. Make an 
unbiased estimate of the weight of the 2000 men. 

2. Suppose that p = 0.3 in the urn problem. What is the probability that 
in a random sample with replacement of 10 balls that σα = 3? that2 < x < 4? 
that 1 <2 < 5? 

3. Suppose that x = 10 in a sample of size 25 in the urn problem. Use the 
method of this section to find 95% confidence limits for p. 

4. Find 68% confidence limits in the same situation. 

5. Could the same procedure be used to find confidence limits for a sample of 
size four? 

6. Suppose that a sample of size four is drawn in the urn problem and that 
two red balls are observed. Find 95% confidence limits for p. (This requires 
finding z by trial so that the probability in the left of (6) is 0.95.) 

7. In the situation of Problem 13 in Section 9-2, find 95% confidence limits 
for the probability that the coin comes up heads. 
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8. In trying to predict a presidential election, a polling group used a sample 
of 10,000. They found that 49% of the sample favored candidate A. With 
what confidence can they predict that A will lose? 

9. Taking account of (8-9-8), find a 95% confidence interval for estimating 
the mean weight in Problem 1, assuming that the standard deviation of the dis- 
tribution of the weights is 25 lb. 

10. Do the same for 68% confidence limits. 

11. Rework Problems 9 and 10 for a sample of size 25. 

12. Could we apply the method for a sample of size 10? 
*%13. Let 21, 22, ..., 2n be observed values of a random variable v with fre- 
quency f(z, 9), where θ is to be estimated. Assuming independence of the obser- 
vations, the probability of observing these n values is f(x1, 0)f(x2, 0)... f(%n, 8). 
The value of this expression is called the likelihood. A value of @ Gf any) that 
makes the likelihood maximum is called a maximum likelihood estimate of @. 
Show that the mean of a sample is a maximum likelihood estimate of the mean 
of a normal population with unknown mean μ and variance o”. 
% 14. Similarly show that the mean of a sample from a population following the 
Poisson ee is a maximum likelihood estimate of the parameter X. 
*15. Show that σξ = E[x?] — (E£[z])?. 
#16. Show that E[z?] > (E[z])?. 
% 17. An insurance company wishes to be 99% sure that the mean age of the 
policy holders does not differ from 60 years by more than one year. How large 
a group of policy holders is required, assuming that the mean and standard 
deviation of all insurable people available as policy holders are 60 and five? 
*% 18. Suppose that 10 fish in a lake are marked and put back. Then 10 more 
fish are caught and two fish are found to be marked. How many fish are there 
in the lake? (See An Introduction to Probability Theory and Its Applications, 
by William Feller, pp. 37 ff) 
% 19. Show that the midpoint of the interval for estimating p from (14) is not 
at z/n unless z/n = 1/2. Under what conditions is this midpoint less than 
z/n? Show that this midpoint approaches x/n as n approaches © or as z ap- 
proaches zero. 


ANSWERS TO EXERCISES 


(a) Perhaps the second, unless a very large number of estimates are to be 
made and averaged. (0) Plz/n = 1/2] = P[49.5 < x < 50.5] = 2H(1/10) = 
0.08; P[0.4 < 2/n < 0.6] = 0.96; and P[(0.3 < z/n< 0.7] = 1. (ὃ 0.45 
and 0. 55. (g) The inequalities are equivalent to (x — np)/W npq < 2. Squaring 
both members yields (14). See (3-9-16), (8-5-41). (8) See (6). (i) 0.5 and 
0.69. (j) 0.71 and 0.87.0 (k) 0.18 and 0.29. (1) Same but with (12) re- 
placed by (n? + n)p? — (2nz + n)p + 2? = 0;0.55 and 0.65. (m) z = 2.6. 


ANSWERS TO PROBLEMS 


1. 165. 3. (0.23 0.59) Note that z/n = 10/25 = 0.4. 5. No, since we are 
using the Laplace limit theorem (8-9-10) and 4 is hardly large enough to qualify. 
7. (45 64). 9. Taking —a = ὃ = zin (8-9-8), F(b) — F(a) = 2H(z) = 0.95 
when z = 2. Hence the interval is given by —2 < (165 — p)/(25/10) < 2 
160 <n < 170. 11. 155 < μ < 175. 
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9-4 Statistical decision functions. A decade ago modern statistical 
theory consisted essentially of the exploitation of the ideas and methods 
discussed in the last two sections. Most practical statistical work is still 
formulated in these terms, but the theory has been considerably generalized 
in its conceptions and broadened in its applications. The fundamental 
idea in this development is the concept of a statistical decision function. 
The purpose of this section is to introduce this idea, without, however, 
attempting to go very deeply into either the theory or applications, which 
are still in a state of flux and involve mathematical techniques more 
advanced than can be treated in this book. 

The situations considered in the last two sections have several features 
in common. First, we are required to make a decision, e.g., to accept or 
reject an hypothesis, to decide on a method of testing a hypothesis, or 
to estimate a parameter. Second, we cannot obtain sufficient information 
to make a decision with certainty that it is correct, e.g., we could not 
observe the number of red balls in the urn. The basic problem is to de- 
cide on a satisfactory method of obtaining partial information and of 
basing a decision upon 10. 

These features are common to many other situations that appear in 
science, business, and government. Indeed, most of our decisions are 
made without complete information or certainty of the consequences. 
Nevertheless, we must make decisions and we need satisfactory procedures 
for doing so. A procedure for decision must tell us what experiments to 
perform in order to get information and what action to take on the basis 
of the information obtained. To specify such a procedure is equivalent 
to specifying a function whose domain is the set of possible outcomes of 
some experiment and whose range is the set of possible actions. The 
image of each experimental outcome is the action we are to take, so that 
the function gives a complete guide for decision. We call such a function 
a decision function. If the action taken depends on observed values of a 
random variable, so that the independent variable and hence the de- 
pendent variable in the decision function are random variables, the func- 
tion is called a statistical decision function. A test of a hypothesis is a 
statistical decision function since it gives a correspondence between ob- 
served values of a random variable and two possible actions, acceptance 
or rejection. A method of point estimation is a statistical decision function 
that maps the experimental outcomes into the set of all possible values of 
the parameter to be estimated. Here the set of possible actions may be 
infinite, each action consisting of asserting that the parameter has one 
of its possible values. Similar remarks apply to methods of interval 
estimation. 

(a) Verify this by indicating the domain and range of the statistical decision 
function corresponding to the method of interval estimation. 
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The basic problem in statistics is to choose statistical decision functions 
appropriate to specific situations. To do this we wish to have criteria for 
choosing such functions. To decide on satisfactory decision functions we 
must consider all the consequences of adopting them. This means con- 
sidering the losses that may result from wrong decisions as well as the 
cost of gathering information. In the previous sections the cost of obtain- 
ing information did not enter since we specified an experiment and left 
open only the question of how the resulting information was to be utilized. 

To indicate in a very simple context the factors that are involved in 
choosing a decision function, we imagine a manufacturer of a product 
(say light bulbs) who sells the product in lots of 100. His customers ex- 
. pect an occasional faulty bulb, but there will be complaints and loss of 
business if there are too many. For simplicity we assume that there are 
just two possibilities, 1 faulty bulb in a lot or 10 faulty bulbs. In the 
former case the customer is satisfied, in the latter case we assume that his 
dissatisfaction costs the manufacturer $500. We assume that it costs $1 
to test a bulb, and $100 to reject a lot and refrain from selling it. Suppose 
also that from quality control records it is known that the probability 
of 1 faulty bulb is 9/10, and that of 10 faulty bulbs is 1/10. What procedure 
should be followed? 

Letting x be the number of faulty bulbs in a lot, we may summarize 
the situation as follows. 

_ Possible states: 


G: the lot is good, 1.6., x = 1; P[G] = 9/10. 
B: the lot is bad, i.e., x = 10; P[B] = 1/10. 
Possible terminal actions: 


A: accept the lot and sell it. 
R: reject the lot and do not sell it. 


Fither situation may be combined with either terminal action. The fol- 
lowing table summarizes the costs associated with each combination. 


G B 
A 0 | 500 
(1) -------- 
& | 100 | 100 


One procedure is simply to reject all lots. Then the cost per lot is $100. 
There is no danger of losing $500, but there is also no possibility of doing 
any business! Another possibility is to accept every lot. Now the cost 
C under any procedure is a random variable. In this case it has two values, 
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0 and 500, with probabilities of 9/10 and 1/10 respectively. Hence, if we 
accept all lots, the expected cost per lot is given by E[C] = 0 - (9/10) -Ὁ 
500 - (1/10) = 50. Apparently, in this situation, it costs less to accept 
all lots than to reject all lots. 


Make a table such as (1) and find expected cost of accepting all lots and re- 
jecting all lots for: (0) Cost of accepting a bad lot is $1000 instead of $500. 
(c) Cost of accepting a bad lot is $2000. (ἃ) Probabilities are 8/10 and 2/10 
instead of 9/10 and 1/10. (6) Probabilities are 99/100 and 1/100 instead of 
9/10 and 1/10. 


Evidently if we are required to accept or reject all lots, it may pay to 
accept all of them or reject all of them (and go out of business on this 
item) depending on the underlying costs and probabilities. However, it 
seems obvious that we might reduce costs by testing some bulbs from each 
lot. Suppose we adopt the dec.sion function D, consisting of testing one 
bulb and rejecting or accepting the lot according as this one bulb is bad 
or good. The expected cost C is $1 plus the sum of the entries in (1) each 
multiplied by the probability that it is the outcome. For example, letting 
y be the number of faulty bulbs observed, the probability of rejection 
when the lot is good is P[G|P[y = 1{6] = (0.9)(0.01). Hence for D,, 
E[C] = 1 + 0(0.9)(0.99) + 500(0.1)(0.9) -Ε 100(0.9)(0.01) + 100(0.1) 
(0.1) = 47.90. 


(f) Let De consist of testing two bulbs and rejecting the lot if either is faulty. 
Show that the expected cost is $46.20. (g) Find the expected cost if three 
bulbs are tested and the lot rejected if one or more is faulty. (8) Find the ex- 
pected cost for Dio consisting of testing 10 bulbs and rejecting if one or more 
is faulty. 


We have considered here only a few possible decision functions. [Ὁ 
seems reasonable to prefer one decision function to another if it yields a 
lower expected cost. (Why?) Considering cost as negative income (and 
income as negative cost), we may rephrase this by saying that a decision 
function is preferable if it yields higher expected income. This suggests 
the following maximization prinezple: 


Choose a decision function so as to maximize expected income 
(1.6., minimize expected cost). 


(2) 


(i) Argue that among the decision functions D, consisting in testing n bulbs 
and rejecting if one or more is bad, there is a best one according to criterion (2). 
%(j) Find it. 


PROBLEMS 


ΧΙ. Rework Exercises (f) through (h) with P[G] = 8/10, P[B] = 2/10. 
*2. Do the same for P[G] = 0.99, P[B] = 0.01. 
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*3. Suppose we know that the urn with ten balls has been filled by someone 
who chose at random a number between 0 and 10 inclusive, placed that number 
of red balls in the urn and completed with green balls. We are asked to estimate 
the number of red balls by sampling four balls with replacement. If our estimate 
is correct, we get $500. For each 1/10 we miss, we get $50 less, so that we gain 
nothing if we estimate Ὁ and there are penny 10. How should we estimate: What 
is our expected pay? 

* 4. Reconsider Problem 3 for a sample of 100 with replacement 

*5. Verify the statement in the footnote on page 58 of “The Theory of Statis- 
tical Decision,” by L. J. Savage, in the Journal of the American Statistical 
Association, March, 1951. | 

#6. Read the article referred to in Problem 5, and formulate the minimax 
principle in your own words. 

*7. Read “Statistical Decisions,” by E. 5. Keeping, in the American Mathe- 
matical Monthly, March, 1956. Check the author’s calculations. 

*8. In the ancient method of trial by ordeal, the accused was subject to an 
ordeal, such as walking over hot coals, and acquitted if he survived. Suppose 
that the probability of survival for an innocent man was 80% and for a guilty 
man 50% (because of superstitious fear of the ordeal). Discuss the effectiveness 
of this method in terms of the probabilities of the accused being guilty, a prob- 
ability that depends on the care exercised by officials and the community. 


ANSWERS TO EXERCISES 


(a) Domain is set of possible observed values of some random variable. 
Range is set of corresponding intervals. (Ὁ) 100, 100. (ὁ) 200, 100. (d) 100, 
100. (e) 5,100. (g) 44.9. (h) 42.5. (() There are only a finite number 
of such decision functions, since n < 100. 


FURTHER READING 


The modern theory of statistical inference is closely related to the theory of 
games. The following references are suitable for the student who wishes to follow 
up the meager introduction given in this chapter. 

Statistics, A New Approach, by W. Allen Wallis and Harry V. Roberts, The 
Free Press, Glencoe, Illinois, 1956. An elementary book with many interesting 
examples. | 

First Course in Probability and Statistics, by Jerzy Neyman, Henry Holt and 
Co., New York, 1950. A careful discussion of basic ideas by one of the architects 
of modern statistics. 

The Compleat Strategyst, by John D. Williams, McGraw-Hill, New York, 1954. 
Very easy introduction to game theory. 

Games and Decisions, by R. D. Luce and H. Raiffa, Wiley, New York, 1957. 

Theory of Games and Statistical Decisions, by Ὁ. H. Blackwell and Μ, A. 
Girshick, Wiley, New York, 1954. 

Introduction to the Theory of Games, by J. McKinsey, McGraw-Hill, New 
York, 1952. 


* CHAPTER 10 
ABSTRACT MATHEMATICAL THEORIES 


10-1 The nature of abstract theories. The dictionary descriptions of 
“abstract” that apply here are “considered apart from any application 
to a particular object,” “without reference to a thing or things,” and 
“general, as opposed to particular.” All mathematical theories are ab- 
stract to some extent. For example, the statement 2 + 3 = 5 says in 
effect that two things and three things are five things, without reference 
to any particular things. Of course, it is also a statement about the par- 
ticular numbers 2, 3, and 5. The law a + ὃ = 6+ a is more abstract, 
since it does not refer to any particular numbers. Still, it does refer to 
numbers rather than some other objects. In the form VaVba € Re A DE 
Re—->a+b=b6-+ 4, it is clearly a statement about a particular thing, 
namely the real numbers Re. We see from these examples that there are 
various levels of abstraction and that a theory may be abstract from one 
point of view and concrete from another. 


(a) To what extent 152 - 3 = 5 “apart from any application to a particular 
object,” and to what extent does it apply to a particular object? (Ὁ) Discuss 
the physical law, “A freely falling body moves with constant acceleration,” 
from the point of view of the preceding paragraph. (0) Is it correct to say that 
all scientific theories are abstract to some extent? (d) Does abstractness 
imply lack of applicability? 


The term “abstract mathematical theories” designates mathematical 
theories on a considerably higher level of abstraction than the examples 
mentioned. To make clear how the term is used, we give a series of pro- 
gressively more abstract statements. “Two ducks and three ducks are 
the same as three ducks and two ducks” is a statement of fact about ducks. 
It is already abstract to some extent, since no particular ducks are men- 
tioned. “Two plus three is the same as three plus two” is more abstract, 
since we do not specify what is being counted. On the other hand, we are 
referring specifically to the numbers 2 and 3. The lawa+b=b+ain 
the context of the real numbers, is much more abstract, as we suggested 
above. Now suppose we simply write the law Va Vb a + ὃ = ὃ + a, with- 
out indicating whether we are talking about integers,. rational numbers, 
real numbers or complex numbers. We may investigate the consequences 
of this law without concerning ourselves with what numbers are involved. 
If we do so, we are considering the commutative law in abstraction from 
the particular numbers to which it applies. However, we are still think- 
ing of numbers and the operation of addition. But we have seen that the 
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commutative property applies to things other than numbers. Thus we 
have such lawsasp Vqg~e=qVppAq=qAp,AUB=BUA, 
and An B= Bon A. We may abstract from the particular operations 
by considering an unspecified binary operation (indicated by *) on an 
unspecified set of objects S such that the law a * 6 = ὃ * a holds for all 
members of S. 

In the last example we have abstracted to the point where the objects 
under discussion are not specified (the name S 15 adopted merely as a 
convenience and tells us nothing about its members) and the operation 
is unspecified except to indicate that it serves to yield an object when 
it is applied to an ordered pair of members of S. After all this abstraction, 
what do we have left? Merely the statement that the binary operation * 
(unspecified) is commutative in S (unspecified). We can then investigate 
the consequences of this law. Of course, a theory based on only this one 
law would not be of much interest. The typical abstract mathematical 
theory involves a number of laws about unspecified objects, relations, and 
operations. 

We have presented most of the theory of this book in axiomatic form. 
That is, we have begun each theory with undefined terms and axioms, 
introduced definitions, and derived theorems from the axioms on the basis 
of rules of proof. This is not the only way in which mathematical theories 
can be developed. But it has many advantages, and it 1s particularly use- 
ful in abstract theories, in which we cannot rely on any intuitive knowledge 
of the objects under consideration. Of course, in developing any axiomatic 
theory we can base our proofs only on the stated axioms and definitions, 
even though we may “know” a good deal about the objects of the theory. 
But in an abstract theory, the objects are unspecified and we know nothing 
about them except that they satisfy whatever axioms we are using as the 
basis for our theoretical construction. 


(e) Would it be correct to say that we make no use of our intuitive knowledge 
in constructing mathematical theories? 


Of what use is a theory about unspecified relations and operations in 
the context of unspecified objects? Very useful indeed just because of the 
lack of specification! An abstract mathematical theory is applicable to 
any particular situation in which the objects and operations “fit” the 
theory, that is, satisfy the axioms of the theory. If the objects in any 
particular situation satisfy the axioms, then they must satisfy all the laws 
of the theory. Hence, an appropriately chosen abstract theory is “tailor 
made” to give us answers in particular situations. It frequently happens 
that a single abstract theory is applicable in many entirely different situa- 
tions. (A trivial example is the commutative law mentioned above.) 
Hence, by developing an abstract theory we may be able at one stroke to 
solve problems in many different areas. This is precisely the value of any 
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abstract theory, from the simplest generalizations in science to the most 
abstract mathematics. The more abstract the theory, the wider its poten- 
tial applicability. 

(f) Which is of wider applicability, 2+ 3 = 3+ 2 or a+6=6-+ a? 
(g) Which is more abstract and which is more widely applicable, the theory of 
freely falling bodies or the theory of bodies moving with constant acceleration? 
(h) Bertrand Russell is reported to have said in an after-dinner speech that 
mathematics is the science in which we never know what we are talking about 
or whether what we say is true. Explain. 


The purposes of this chapter are (1) to introduce the reader to several 
of the most interesting theories of abstract mathematics, (2) to give him 
some insight into the way in which abstract theories are constructed, and 
(3) to review some of the ideas of previous chapters in an abstract setting. 


ANSWERS TO EXERCISES 
(c) Yes. (d) Onthe contrary! (e) No. (ἢ) Thelatter. (9) The latter. 


10-2 Fields. The notion of a field may be obtained by abstraction from 
the properties of real numbers embodied in the axioms (1--16--1} through 
(1-16-10). Suppose we have a set S that is closed under two binary 
operations (1-16-1) which are associative (1-16-4 and 1—16—5) and com- 
mutative (1-16-2 and 1—-16-3); that contains an identity element for each 
operation (1-16-7 and 1-16-8); that contains an inverse of each of its 
elements with respect to the first operation (1-16-9); that contains an 
inverse of each of its elements except the first identity element with respect 
to the second operation (1-16-10); and such that the second operation 
is distributive over the first (1-16-6). We call the operations plus (+) 
and times (-) and call the corresponding identity elements zero (0) and 
one (1), though there is no requirement that they be the familiar opera- 
tions or numbers formerly so designated. A system {S, τ -, 0, 1} satis- 
fying these axioms is called a field. 


(a) Name three sets of numbers that are fields. (Ὁ) If we know that a 
system is a field, can we apply to it any of the algebraic identities proved for 
real numbers? Why? 


Even if the examples in Exercise (a) were the only possible fields, the 
concept would be of some value because of Exercise (b). However, there 
are many other (indeed an infinite number) of different fields. Consider, 
for example, the set S consisting of the numbers 0, 1, and 2. We define 
addition and multiplication as follows: Add or multiply as usual and then 
subtract multiples of 3 until the result is one of the members of S. Take 
0 and 1 as the identity elements. With these definitions {S,-+, -, 0, 1} is 
a field. To justify this statement we must verify that all the axioms hold. 
The following addition and multiplication tables help “see” the situation. 
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+0 1 2 0 1 2 
00 1 2 0 0 O 

1 2 
a) 1 1 2 0 ϑ 0 1 2 
0 2 | 


220 1 2 


(c) Verify the tables by doing the arithmetic. 


From the tables or the definition of addition and multiplication it 18 
obvious that S is closed. Associativity follows from the definition of the 
operations, since we begin by applying the usual operations (which we 
know are associative) and then subtract multiples of three (which is in- 
dependent of the way in which we have associated the numbers in the 
first calculation). Commutativity and the distributive law follow in the 
same way. We can also verify the commutative law by noting that the 
tables are symmetric about the main diagonal from upper left to lower 
right. From the tables also we see immediately that 0 and 1 have the 
desired properties. The existence of an inverse is shown in the tables by 
the fact that 0 appears in each row and column of (1) and 1 appears in 
each row and column of (2) except the row and column corresponding to 0. 

(d) Show that {0,1} together with addition and multiplication defined as 
above but subtracting multiples of 2 instead of multiples of 3, is a field. (6) Show 
that {0, 1, 2,3}, with the operations defined as above but subtracting multiples 
of 4 is not a field. (f) Show without reference to the tables (1) and (2) that 0 
and 1 have the desired properties and that the required inverses exist. 

We have given two examples of finite fields, that is, fields with a finite 
number of elements. There are an infinite number of finite fields con- 
structible similarly. Indeed, consider the set {0, 1, 2,...m — 1}, where 
nis a prime number. Define the sum and product as obtained by the usual 
operations followed by the subtraction of multiples of n until one of the 
elements is obtained. Then an argument similar to those above shows 
that the system is a field. 


(g) Why must 7 be prime? 

Are there infinite fields different from the rationals, reals, and complex 
numbers? Yes, an infinite number. Consider, for example, the set of all 
real numbers of the form a - ὃ \/2, where a and ὃ are rationals, together 
with the usual operations of addition and multiplication and the two iden- 
tity elements 0 + 0 ΝῺ and 1 + θν 2. It is obvious that all axioms are 
satisfied, except possibly for the existence of the inverses. However, the 
identities 
(3) (a + bV2) + (—a — ὦ -- 0+ 0V2 


(4) (a + bV2) [ v2| “τον 


|, ἐπὶ pean eeees 
a2 — 2b2 az — Peete 


exhibit the existence. 


552 ABSTRACT MATHEMATICAL THEORIES [cHap. 10 


(h) How does (4) show lack of a multiplicative inverse for 0-+ 0/2? 
(i) Show that the above system is closed under multiplication. (j) What 
about the possibility of a? — 2b? = 0 in (4)? (k) Show that the numbers 
of the form a-+ bvc, for fixed ὁ an integer not a perfect square and a and ὃ 
rational, form a field. (1) Show that if we modify Exercise (k) to require that 
a and ὃ be integers, the result is not a field. (m) Show that we do not get a 
field if we allow a and ὦ to be any reals in Exercise (k). 


We defined subtraction and division in (1-14-19) and (1—14-1) so that 
(as proved in Section 1-16) 


(5) a—b=a-+ (—d), 
(6) az+b=a-b1, 


Evidently this can be done in any field because of the existence of the 
inverses —b and δ᾽ ἢ. 


(n) Do subtraction and division in any field have all the properties that we 
derived for real numbers? (0) In the system of Exercise (6) find 2/3, 3/2, 
(2/3)/(1/2), and (1/3)(1/2). (Ὁ) Why is division by zero undefined in a field? 


PROBLEMS 


1. Show that the complex numbers with rational components form a field 
under the usual operations and with the usual identity elements. 

2. How do the tables (1) and (2) illustrate the fact that in a field there is 
only one element with the properties of 0 and only one with the properties of 1? 
(We describe this by saying that the identities are unique.) 

3. Can a field have no elements? Can a field have just one element? 

4. Show that the real number of the form αν  - ὃν 8 with a and 6 rational 
do not form a field. 

5. Do all numbers of the form a/2”, where a is an integer, form a field? 

6. Answer the same question if a may be any rational. 

7. In the definition of a field, we used equality (=) as the one relation of 
our system. A slightly more abstract theory results from using an unspecified 
relation R. We require merely that this unspecified relation be reflexive, sym- 
metric, and transitive, 1.e., | 


(7) xRex (Reflexivity), 
(8) zsRy—oyRe (Symmetry), 
(9) ( ἂν A y R2) > (ὦ R2) (Transitivity). 


Such a relation is called an equivalence relation. Then we may define a field as 
a set {S,+,-,0, 1,=}, where = is some equivalence relation. Let a relation 
called “congruence modulus 3” be defined as follows: 


(10) Def. [a = ὁ (mod 3)] = [a — 6 is divisible by 3]. 
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We read a = ὃ (mod 8) as “a is congruent to ὁ mod 3.” Show that congruence 
mod 3 is an equivalence relation. Show that the integers form a field with this 
equivalence relation. 

8. Is = an equivalence relation? Name as many equivalence relations as 
you can. What property does = have that is not possessed by other equivalence 
relations? 

9. Define congruence modulus m. Show that the integers with congruence 
mod m form a field if and only if m is a prime. 

10. Solve 32 — 2= x-+ 7 (mod 11). How many solutions are there? 
11. Do the procedures for solving equations apply to solving congruences? 
12. Does x? + 2. - 1 = 0 (mod 3) have a solution? Does x? + 22 + 2= 0 


(mod 3)? 
13. Solve the following simultaneous congruences. 
stop y= 1; 
(mod 5) 
e— y= 2. 
14. Solve 
g-- 2y = 6, 
(mod 7) 
22 — y= 1. 
15. Why does 
32 — y= 1, 
(mod 5) 
4x -+ 2y = 1, 


have no solution? 

16. The field of integers with congruence mod ™m is called the field of integers 
mod m. What numbers in the field of integers mod 3 are perfect squares (Le., 
squares of elements of the field)? 

17. Answer the same question for the integers mod 7; mod 11; mod 13; and 
mod 17. 

18. A system that satisfies all the requirements for a field, except for the 
deletion of the requirement of the existence of multiplicative inverses and the 
addition of the cancellation law, ac = be — a = ὃ, is called an integral domain. 
Give a familiar example. ΩΝ 

19. Argue that any field is an integral domain. 

20. An ordered field is a field with a subset (called the positives) closed under 
the two operations and such that any member of the field belongs to this subset, 
or else is zero, or else its additive inverse belongs. State this definition using 
logical symbols. Are the reals an ordered field? are the rationals? How is this 
definition tied in with the concept of “less than”? 

21. Show that the integers mod m (prime) are not an ordered field. 

22. Show that the field 0, 1, 2,...,” — 1 defined in the section is not an 
ordered field. 

23. Show that the complex numbers are not an ordered field. 

24. Show that the set of all numbers formed from ~/2 by the operations +, 
—,-, and + form a field. 
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ANSWERS TO EXERCISES 


(a) Rationals, reals, complex numbers. (b) Yes, since all the identities were 
derived from the axioms alone. Hence we could prove them for any field with 
the same proofs. (0) For example, 2-+ 2 = 4. Subtracting 3, we find 1 as in 
the table. (d) Write tables and verify axioms. (6) 2+ 2 = 0 in contradic- 
tion with (2-8-5). (Ὁ From the definition of the operations it is immediate. 
(g) Otherwise we would have a- ὃ = Ὁ with neither a nor ὃ zero, in contradic- 
tion to (2-8-5). (8) Since a? — 2b? is then zero and the inverse is undefined. 
(i) (at bV2)(e + ἀν 2) = (ac + 2bd) + (ad + be)V2. (7) Impossible since 
/2-is not rational. (k) Verify axioms and find inverses. (1) Multiplicative 
inverse fails. (m) Multiplicative inverse fails, since we get zero denominators. 
(n) Yes, since all the derived properties were proved from the axioms and 
definitions. (0) 2/3 = 2; 3/2 does not exist; same for other two. (p) See 
Exercise (n). 


ANSWERS TO PROBLEMS 


1. Closure is the only nonobvious property and can be shown by calculating 
a typical product. The essential thing is that the components of a sum, difference, 
product, and quotient are rational. 3. No, because we require the elements 0 
and 1. Possibly, since we might have just one element with 0 = 1. Fields with 
less than two elements are called trivial. 5. No. Multiplicative inverse fails. 
7. It has the tables (1) and (2). By use of the definition of congruence, each 
axiom can be verified. 9. α is congruent to ὃ mod m if and only if ὃ — ais divisi- 
ble by m. If m is not prime, we have the difficulty indicated in Exercise (g). 
11. Yes. The proofs require some modification, since the Rule of Replacement 
does not apply to all equivalence relations, but the properties (7) through (9) 
together with the axioms are sufficient. 

13. (2,0). 15. If the first equation be multiplied by —2, it becomes 4x -|- 
2y = 2, which is inconsistent with the second equation. 17. 1, 4, 2; 1, 4, 9, 5, 
3; 1, 4, 9, 3, 12, 10; 1, 4, 9, 16, 8, 2, 15, 18. 19. The cancellation law holds in 
any field; hence a field has all the integral domain properties. 20. See Section 
3-5. 21. Zero cannot be in the positive subset, but if we add 1 to itself m 
times, we get zero. Hence 1 cannot be in the subset. Then —1 must be in the 
subset, and (—1)(—1) = 1 must be in the subset, which is a contradiction. 
23. Since in any ordered field the square of any nonzero element is positive (the 
proof of 3-5-21 applies), and since 77 = —1 is not positive. 


10-3 Groups. The axioms in Section I-16 and the various theorems 
derived from them suggest that the operations of addition and multiplica- 
tion are similar in many ways. In Section 1-9 we have paired laws to 
bring out the analogy. Suppose we were to put * in place of both the plus. 
sign and the dot in (1-16-2) and (1-16-38). The result would be the same, 
namely a * b = 6 * a. Suppose, in addition, we put 6 in place of both 
0 and 1 in (1-16-7) and (1-16-8). In either case we find a * e = a. 
Finally, suppose we put a’ for both —a and 1/a in (1-16—-9) and (1-16-10). 
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We get α τ a’ =e. These observations suggest the possibility of con- 
structing an abstract theory about an unspecified operation of which the 
familiar addition and multiplication would be special cases. 

We take as undefined a set S, a binary operation *, and an element of 
eof S. We are going to require that S be closed under *, that * be asso- 
ciative, that e have the familiar property of an identity element, and that 
every element have an inverse with respect to +. Such a system {S, *, δὲ 
we call a group. If, in addition, * is commutative, we call the system a 
commutative group. 


Which of the following are groups? (a) The integers under addition. 
(Ὁ) The integers under multiplication. (c) The rationals under division. 
- (d) The nonzero real numbers under division. 


The defining axioms of a group may be stated as follows: 


(1) For every a and ὃ in 5, a Ἐ ὃ is an element of S. (Closure) 
For every a, ὃ, c in S, a* (ὃ * c) = (a* b) ες. (Asso- 
(2) ee 
clativity) 
3) There is an element of S (call it 6) such that for every member 
aofS,a*e=e*a= a. (Identity element) 
(4) For every element a of S there ts an element (call τέ a’) such 


thata x a’ = a’ « a = 6. (Inverse) 


(6) Replace words in (1) through (4) by quantification symbols. (ἢ) In 
the above formulation, what are the undefined constants, variables, and for- 
mulas? (g) Sometimes in defining a group it is specified that a * ὁ is “unique,” 
meaning that for any value of a and 6 in S there is just one value of a * 6. Why 
is this implicit in the above definition? 


To see a very simple example of a noncommutative group, let S be the 
set of all the operations involved in dressing and undressing. Let “a Ἐ b” 
-mean the operation of performing a and then ὃ, let e be the operation of 
doing nothing, and let a’ be the operation of undoing a (i.e., putting on 
if a was taking off something, and taking off if a was putting on). Then 
the axioms are easily seen to be satisfied. For example, (4) holds since 
putting something on and taking it off again (or the reverse) is the same 
as doing nothing. But putting on socks and then shoes is hardly the 
same as putting on shoes and then socks! 


(h) Construct another example of a noncommutative group. (1) Suppose 
we make the definition 


(5) Def. acb=arxb’. 
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To what operation does » correspond if * is multiplication? addition? (j) Addi- 
tion mod m is the process (described in Section 10-2) of adding in the usual way 
and subtracting multiples of m until a number from the set {0,1,...,m — 1} 
is obtained. Similar meaning is assigned to other operations mod m. Show that 
{0,1,...,m — 1} form a group under addition mod m for any integer m. 
(k) Under what condition does {1,2,...,m} form a group under multiplica- 
tion mod m? ([) In what sense is a clock a device for addition mod 12? 


Some very interesting groups result from the study of the symmetries 
of geometric figures, and these groups have applications in chemistry, 
physics, crystallography, and art. Consider, for example, the square pic- 
tured in Fig. 10-1. The figure is obviously symmetric about the dotted 


Figure 10-1 


lines A, B, H, and V. One way of expressing this fact is to say that the 
figure can be rotated about these lines (up out of the plane and back into 
it) through an angle of 180° without changing its appearance. Or we may 
think of the figure as being reflected in one of the lines without altering 
its appearance. The figure is also symmetric about the point O. More- 
over, this central symmetry has the special feature that if we rotate the 
square in the plane about O through an angle of 7/2, 7, 37/2, and of 
course 27, the appearance of the figure remains unchanged. 


(m) Check the definitions of symmetry in Section 5-15. 


To analyze these symmetries more carefully, we label the vertices of 
the square as indicated in the figure. We imagine the point O and the 
lines A, B, V, and H as remaining fixed. We let the letters A, B, V, and 
H serve also to name the action of reflecting the figure in the corresponding 
lines (or rotating it through 180° about these lines). We let R1, Re, and 
3 be the rotations of the figure about O through the indicated angles. 
Finally, we introduce 7 to represent the action of no motion at all. Now 
we let X * Y mean the motion that achieves the same result as X followed 
by Y. 
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FIGuRE 10-2 
d a a b d c 
Cc b d ς a b 
Figure 10-3 


For example, R; * R, = Ro, since a rotation of 90° followed by an- 
other of 90° is the same as a rotation of 180°. Similarly, R; * Ro = 15, 
and so on. We see also that A * A = TJ, since reflecting twice about the 
same line re-establishes the status quo. It is easy to calculate these “prod- 
ucts,” but for a case such as A * B or R3 * H, it is convenient to keep 
track of what happens by drawing diagrams. [or example, to find A * B, 
we first draw the square in the original position, then as it appears after 
reflection in the line A, and finally as it appears after reflecting this square 
in line B. (See Fig. 10-2.) Note that the axes do not move, and that B 
means reflection in the fixed line B of the square in whatever: position 
the square is as the result of previous operations. The final position 
shows that A * B= R,!. Similarly, from Fig. 10-3 we see that 
R 3 * WH = 7: 


Draw the squares and find: (n) B+ A, (ο) H * Rs. 


For reasons that will soon appear, we are interested at the moment 
particularly in the rotations. The reader can easily verify the following 
“multiplication table” of the rotations of the square. 


(6) R, Ry, Re Rs I 


It is not hard to see that these rotations form a group, with J the iden- 
tity element. 
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(p) How does the table reflect that J has the property (3)? (q) How is 
property (4) shown? (r) How is closure indicated? (86) How could you 
check (2) from the table? (0) Can you think of an easier way to demonstrate 
associativity by an argument? | 


Now we consider the group of integers {0, 1, 2, 3} under addition mod 4. 
We have the following table. 


Ἐπ 1 2. 
0 01 2 8 
(7) 1 1 2 3 0 
22 3 0 1 
ὃ 8 01 2 


If the reader compares (7) with (6) term for term, he will find that they 
have the same structure. Indeed, the correspondence {(0, J), (1, 21), 
(2, Re), (3, Rg)} carries (7) into (6). This correspondence is one to one. 
Calling it F, it is evident that F(a + ὃ) = F(a) ε (δ) for any a and ὃ 
in {0,1, 2,3}. Hence the two sets {0,1, 2,3} and {J, Ry, Re, Rs} are 
isomorphic with respect to the relation of equality and the operations + 
and *. When two groups are isomorphic with respect to the group opera- 
tions, we say simply that they are zsomorphic groups. Isomorphisms are 
of considerable use. If we know a property of one isomorphic group, we 
know immediately that it applies to the other. In this example, we can 
answer any question about the rotations of a square by reference to 
addition mod 4! 

(u) In Fig. 10-4 we show an equilateral triangle with the indicated symme- 
tries. Make a table of the group of rotations 1, Ri, Re. (v) Show that this 
group is isomorphic to the group of integers under addition mod 3. (w) Show 
by writing down the multiplication table that the four fourth roots of 1, {1, ὁ, 
--1, —t} form a group under multiplication. (x) Is this group isomorphic to 
the groups in (6) and (7) (y) What can you say about the square roots of 1 
under multiplication? 


No 


L ~ 


A 
Figure 10-4 
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PROBLEMS 


1. Complete the multiplication table of the group of symmetries of the 
square. 

2. Complete the table for the group of symmetries of the equilateral triangle. 

3. Show that the cube roots of 1 under multiplication is a group isomorphic 
to the groups in Exercises (u) and (v). 

4. Can you suggest an underlying reason for the isomorphisms indicated in 
Problem 3? 


In Problems 5 through 32 show that the set under the given operation is a 
group (cite the identity element and inverse) or show that it is not a group and 
indicate why. 


5. The even numbers under addition. 

6. The odd numbers under multiplication. 

7. The odd numbers under addition. 

8. The even numbers under multiplication. 

9. Numbers of the form 2” under multiplication, where n may be any integer. 

10. Numbers of the form 10” under multiplication, where 7 is any real number. 

11. Numbers of the form 10” under addition. 

12. The integers under subtraction. 

13. The positive reals under multiplication. 

14. The negative reals under multiplication. 

15. The nonconstant linear functions under composition. 

16. The quadratic functions under composition. 

17. The exponential functions under composition. 

18. The exponential functions under multiplication. 

19. The power functions 17 with domain the positive reals and with exponent 
any rational number, under composition. 

20. The debit and credit entries in a charge account, where a * 6 means the 
entry that has the same effect as a followed by ὁ. 

21. The set of all valid contracts between two individuals, where a * 6 means 
the contract equivalent to contract a followed by contract ὃ. 

22. Numbers of the form n/*/2, where n is an integer, under addition. 

23. The five fifth roots of 1 under multiplication. 

24, Numbers of the form 1/5”, where n is an integer, under addition. 

25. The power set of a given set under the operation of union. 

26. Operations of the form Ὁ" (where n is any integer, D°f = f, where Ὁ 
is the derivative, D~! is the antiderivative, and these operators all have as 
domain and range the set of all functions that can be differentiated any number 
of times) under composition. 

27. Sentences under the operation of conjunction; disjunction. 

28. Sentences under the operation p * g = (p V τῶν (q V ~p). 

29. Two-place decimals under multiplication, where the product is obtained 
in the usual way followed by rounding off to two places. 

30. The odd functions under multiplication. 

31. Plane vectors under the dot product. 
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Figure 10-5 


FicguRE 10-6 


32. The motions that leave the pattern of Fig. 10-5 unchanged. (We imagine 
the pattern continuing indefinitely in either direction.) 


33. Show that the group of positive real numbers under multiplication is 
isomorphic to the group of real numbers under addition. (Hint: This isomor- 
phism is the reason that logarithms are so useful!) 

34. Sketch an isosceles triangle and write the multiplication table of its sym- 
metries. To what group of integers under addition is this symmetry group 
isomorophic? 

35. Answer the same question for the rectangle sketched in Fig. 10-6. 

36. Show that the group of symmetries of an ellipse or hyperbola is isomorphic 
to that of a rectangle (except when the ellipse is a circle). 

37. Construct and Justify a similar statement for the symmetries of a parabola. 
38. What is the group of symmetries of a figure that has “no” symmetries? 
*39. Sketch a regular pentagon and write the multiplication table of its sym- 

metries. 

40. Describe the group of symmetries of a regular polygon of an even number 
of sides. 

41. Do the same for one with an odd number of sides. 

42. Suppose we have a certain geometric figure G. We call a movement of 
the figure that leaves its appearance unchanged a symmetry. Let G be the set 
of all symmetries of a figure, and let a * ὃ be the movement equivalent to a 
followed by 6. Show that the symmetries form a group. 

43. Show that the n nth roots of 1 form a group under multiplication. To 
what group of symmetries is this group isomorphic? To what group of integers 
under addition is it isomorphic? 

44. Describe the group of all symmetries of a circle. To what group of numbers 
under addition is the group of rotations isomorphic? 

45. Consider the set of all angles under the operation of addition and with the 
relation of congruence as defined in Section 4-9. Do they form a group? What 
is the relation between congruence in the geometric sense of Section 4-9 and 
congruence modulus some number m? 
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*46. A pattern such as that of Fig. 10-5 laid out on an infinite straight line 18 
called a border. Any border has a characteristic group of motions that leave it 
unchanged in appearance. Construct some borders and characterize their 
groups. 

*47. A pattern that covers the entire plane in some repetitive fashion is called 
a tesselation. (Imagine an infinitely large wall covered with wallpaper or an 
infinite floor inlaid with some pattern of tiles.) Each tesselation has a charac- 
teristic group of motions that leave its appearance unchanged. Draw some 
tesselations and investigate their groups. 

*% 48. Solid figures have symmetries that may be studied as we have those of 
plane figures above. Write the multiplication table of the symmetries of a cube. 
Investigate other solid figures. (Hint: Consider reflections in a plane of sym- 
metry, reflections in a point of symmetry, and rotations about a line of sym- 
metry.) 

* 49. Consider the set of all possible colors and let xa * yb be the color resulting 
from mixing z parts of a and y parts of ὃ. Do the colors form a group under *? 
%* 50. Consider the substitutions that can be made for variables in formulas. 
Do they form a group? 

*%51. Do the productive operations of a factory form a group? 

#52. Leta +b = a+ ὃ — ab, for a and ὃ real numbers. Is this a group? 


ANSWERS TO EXERCISES 


(a) Yes. (Ὁ) No, inverse fails. (ὁ) No, since 0 has no inverse. (d) Yes. 
(0) VaVbaEeESAbES A a*xbES;VavbVeaESASbESAcCES— 
a*x(b*c) = (ᾳ δ), Ἐε;Ξ ESA VaaeCS >~a*2x=2%*a = a(then we 
can use 2-10-9 to name this element δ); να ας Καὶ -ὁ᾿ 3Ξχα Ἐπὴ τ τ τὰ = 6 (again 
we use 2-10-9 to name this element a’). (f) Constants are “S,” “x,” and “e”; 
variables are small Latin letters, and formula is “a * ὃ. (6) In our treatment 
we implicitly took “a * δ" as an undefined formula. But by definition a for- 
mula has only one value for each set of values of its variables. (i) Division; 
subtraction. (j) Verify axioms. (k) mmustbe prime. (1) Two times that 
differ by a multiple of 12 yield the same clock position. 


d a d ὃ b c 

ς ὃ ΠΝ b " d 
Figure 10-7 

b a 
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Figure 10-8 
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(n) Fig. 10-7. (0) Fig. 10-8. (p) The row and column of J repeat the 
borders. (q) 1 appears once in each row and column. (r) All entries are 
members of the set. (5) By trying all cases, all 24 of them! (t) Since in 
this case a * (ὃ * c) means the result of the motion a followed by the result of 
the motion b followed by ¢, and (a * δ) * c means the result of motion a followed - 
by ὃ and then followed by c, each mean the result of performing a, ὃ, and ὁ in 
that order. (u) See Exercise (v).  (v) Under the correspondence {(0, J), 
(1, Ri), (2, Re)}. (w) See Exercise (x). (x) Under the correspondence 
(0, 1), G, 2), (2, —1), (8, —7)}. (5) Group isomorphic to the integers mod 2. 


ANSWERS TO PROBLEMS 


ie * I Ry Re R3 H V A B 
I Ry Re R3 H V A B 
Ry Ry Roe Rs I. A B V H 
Ro Rp ἢ I Rk V HH BA 
13 Kz I Ri Ro B A H VY 
H H B V A Ι Re R3 Ry 
V V A H B Ro 1 Ry [3 
Α A Η 8 V Ry 13 I Ro 
B B V A H R3 Ry Re I 


3. Show that if w = (—1-+ V37)/2, then the three cube roots of 1 are w, w?, 
and w* = 1. Then make the table. 5. Yes, 0 is identity, and —a is inverse. 
7. No, not closed. 9. Yes, 2° is identity, 2—” is inverse. 11. No, not closed. 
13. Yes, 0, 1/z. 15. Yes, J, (1 — b)/m is inverse of mI + ὃ. 17. No, not closed. 
19. Yes, 19, I is inverse of 1, and 11 is inverse of I" for r # 0. 21. No, inverse 
fails. 23. Yes, isomorphic to addition mod 5 of integers. 25. No, inverse fails. 
27. No, inverse fails. 29. No, identity not unique. 31. No, closure fails. 33. A 
correspondence is given by 10], οἷ, or any other exponential with positive base. 
35. None. 37. Integers mod 2. 41. There are n — 1 rotations through multi- 
ples of 27/n and n reflections through lines Joining vertices to midpoints of the 
opposite sides. 48. Rotational symmetries of regular n-sided polygon. Integers 
mod n. 45. Yes. Congruence of angles is isomorphic to congruence of numbers 
mod 2π. 


10-4 Transformation groups. Since composition of functions is assocja- 
tive by (5-9-6), it seems possible that some sets of functions might be 
groups under this operation. Indeed, we see from (5-9-8) that the iden- 
tity function Z4 might play the role of the group identity, and from 
(5-10-19) and (5-10-20), that the inverse of a function might play the 
role of the group inverse. However, for this to work out it is evidently 
necessary that the set of functions have inverses (1.e., that their converses 
be functions) and that a fixed set serve as both range and domain for all 
of them. 7 
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(a) Explain why this is necessary in terms of (5-9-8), (5-10-19), and 
(5-10-20). 


Suppose that we have a set of one-to-one correspondences (functions, 
transformations) that map a set A onto itself (i.e., A is both domain and 
range of each function) and such that the set is closed under composition, 
contains 14, and contains the inverse of each of its members. Then the 
set is a group under composition, with J4 the identity element and 7? 
the group inverse of 7. We write J for [4 and T {7 for T; ο Tz unless 
ambiguity seems likely. Note that 7,7. is the operation of applying 7; 
and then Τὰ, that is, (717'2)(z) = T2(T1(x)). So as to always be working 
from left to right, we write +7’; 7'2 for the image of x under 7 9 Tn | 

As a simple example of a group of transformations, consider the linear 
functions of the form J + a and let Τὰ = J + a with domain and range 
the real numbers. The identity is J on the reals, the inverse of 7 is Τ᾽ α, 
and T4T, = Tai». The geometric interpretation in terms of an axis 15 
interesting. We may think of Τὰ as translating each point of the axis a 
directed distance a. (We could alternatively think of 7, as moving the 
origin a directed distance —a.) Then 7 is the act of translation through 


O, T_a, reverses T,, and T,4» is a translation equivalent to translating | 


through a and then through ὃ. 


(b) Show that the group of translations 7’, is isomorphic to the real numbers 
under addition. (ὁ) Show that the distance between two points on an axis 
is invariant (unchanged) under the group of translations; 1.e., show that 
[1] — χα] = Τὰ -- xoT a. 


The above example is easily generalized to the plane. Let 74, be the 
transformation that carries (x, y) into (x + a, y + ὃ), that is, (x, y)Tas = 
(x + a,y +b). Let the domain and range be Re X Re. Then the set of 
all such translations is a group. The geometric interpretation is that T'a,. 
moves all points of the plane through the vector (a, ὃ). 


(4) Show that (2, y)Ta. = (ὦ, y)+(a,6). (0) Find Deal ea (f) Is 
this group commutative? (g) What is the inverse of 7,» the identity ele- 
ment? (h) To what group of numbers is this group of translations isomorphic? 
(i) To what group of transformations of coordinates is it isomorphic? 


Because of the convenience of geometric imagery, we often speak of the 
domain and range of a transformation as a “space” and of its members 
as “points,” even when geometry is not involved. 

The above examples referred to spaces with an infinity of points. By way 
of contrast consider the set of all one-to-one transformations of {a, b,c, d} 
onto itself. The identity is J = {(a, a), (0, 6), (ο, ©), (ἃ, d)} and another 
transformation is {(a, Ὁ), (ὃ, c), (6, 4), (d,a)}. A more convenient nota- 
tion for such transformations is to list the points on one line and the cor- 
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responding images below them. The transformations just named are 
given by ἢ 


abe α ὃ ς ἃ 
(1) Chea a ἢ c ἃ al" 


To find the product of two transformations we simply trace each point. 
Thus we have 


2) PO εν (Εἰ 


since on the left, working from left to right, a goes into ὃ and then b goes 
into a, so that the net result is a into a. Similarly, ὃ goes into c and then 
c goes into d, so that the net result is ὃ into d. 


(1) Check the other entries in (2). (k) Calculate 
( boc ΟἽ & boc °) 
δ dac/\d ¢ ὃ as 


(° δ ¢ 4 ({ ὃ ) 
c a ἃ b/ \b ἃ a cs’ 

One-to-one transformations on finite sets are often called permutations, 
and the corresponding groups are called permutation groups. 


(m) What is the connection with the use of this term in Section 8-3? (n) Con- 
jecture a group of permutations isomorphic to the symmetries of the equilateral 
triangle and justify your result. 


() Find 


o 
Qu 


The isomorphism between the group of permutations on three objects 
and the symmetries of the equilateral triangle is not accidental. Whenever 
we move the triangle by one of its symmetries we permute the vertices 
in some way, and conversely. Indeed the symmetries of the triangle is 
just a geometric interpretation of the corresponding permutation group. 
The group of all permutations on n objects is known as the symmetric 
group of degree n. 


(0) Show that the permutations on ἔα, b,c,d} and the permutations on 
{1, 2, 3, 4} are isomorphic. How does this justify the use of the name “the sym- 
metric group”? (p) Is the symmetric group of degree four isomorphic to the 
symmetries of the square? (4) How many members does the symmetric group 
of degree n have? (Ὁ) What property of the regular n-sided polygon remains 
invariant under all transformations of the symmetric group applied to its 
vertices? (5) under those of the group of symmetries of the regular n-sided 
polygon? (t) A formula is said to by symmetric with respect to a set of vari- 
ables if it is invariant under the symmetric group on these variables. List 
several formulas defining polynomials that are invariant under the symmetric 
group on {a, ὃ, ε, d}. 
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We have seen that transformation groups seem to be associated 
with properties that remain invariant. This is no accident. Let 


f(zi, X2,..., 2%) be a formula depending on the n variables indicated. 
Consider the set of all one-to-one transformations mapping (x1, 2, ..., Xn) 
into (x{, 23, ..-<, 22) so that f(r, t2,...,%n) = (1, %g,..., 2,) for all 


values of the variables, i.e., so that the expression remains invariant. 
Then the set of transformations is a group under composition. 


(u) Prove this. 


This idea has been used to classify geometries of different types. Re- 
call that in Euclidean geometry we are permitted to move figures about 
(transformations) provided their shape or size is not distorted, 1.6., pro- 
vided distances remain invariant. Such motions are required, for example, 
in proving congruence theorems by superposition. The set of all such 
motions form a group. What are they and what is the nature of this group? 
They are the translations (considered at the beginning of this section), 
the rotations (see Section 5-17), and the reflections in any line. For ex- 
ample, to prove the congruence of the two triangles ABC and A’B’C’ 
in Fig. 10-9, we need to be able to “flip one of them over,” i.e., to reflect 
A’B'C’ in the line L to get A’’B’'C’’. In Euclidean geometry these two 
triangles are congruent (i.e., can be superposed), but this is only possible 
if reflection is permitted. The translations, rotations, and reflections are 
called the rigid motions, and their group is sometimes called the Huclidean 


group. 


FIGURE 10-9 


There are many types of geometry, in each of which different types of 
transformations are allowed. The most general geometry so far considered 
by mathematicians is the geometry that treats properties invariant under 
any continuous one-to-one transformations with continuous inverses. 
Geometrically, this means that we can do any thing we wish with the plane 
in the way of stretching, moving, or distorting, except that we must not 
cut or tear. This geometry, which studies the properties invariant under 
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the group of all one-to-one continuous transformations 15 called topology. 
In topology, a square and a circle are “congruent”! In the problems we 


consider these and other transformation groups in further detail. 


(v) Why does the continuity of the transformations preclude tearing or cut- 
ting? (w) Why would a square and a circle be “congruent”? (x) Draw a 
figure not congruent to a circle in plane topology. 


PROBLEMS 


1. In Section 5-17 we derived equations (5-17-8) giving the old coordinates 
of a point in terms of the new coordinates when the axes are rotated through an 
angle @. Here we wish the formulas giving the new coordinates in terms of the 
old when the plane (axes fixed) is rotated around the origin through an angle ¢. 
Show that the required formulas for this transformation Τῷ are 


z = xcosd@ — ysin 9g, 


(3) 


y = xsing + ycos¢. 

2. Letting 74 be the rotation of the plane about the origin, what is To? 

3. Find 747» by noting that (z,y)T, = (cxcosd — ysing, xsing + 
y COS ). | 

4. Find the inverse of Τῷ by “looking at the geometry.” 

5. Find the inverse by algebra from the requirement that Tr; T, = γι. 

6. Both translations and rotations leave distance invariant. What addi- 
tional invariance is related to translation? to rotation? 

7. Consider the group of all translations and rotations where Ὑ 1 α,ν0 means 
a rotation through @ followed by a translation through (a, ὃ). Is Τα, the 
same transformation? What is the identity element? 

8. Find the equations defining a reflection about the z-axis; the y-axis. 

9. Do the same for any line through the origin. 

10. Does the set of all reflections in all lines of the plane form a group? 

11. What properties of the conics remain invariant under rigid motions? 
How does this appear algebraically? 

12. A transformation 7 for which xT = mzx-+ ὃ 15 called a linear transforma- 
tion. What is the geometric interpretation in terms of points on a line when 
m > tiandb = 

13. When m > 1 and ὃ ¥ 0? 

14. When0 « m < landdb # 0? 

15. When m < 0? 

16. In general? 

17. Show that the linear transformations form a group. 

18. Can you think of anything that remains invariant under this group? 

19. Make a multiplication table of the symmetric group on two objects. 


Let 
fl 2 1 2 - 
Gai) a=(5 j): | 


10-4] TRANSFORMATION GROUPS 567 


20. What group of symmetries of a geometric figure is isomorphic to this 
group? 

21. Write down the permutations on the vertices {a, ὃ, ὁ, d} that result from 
each symmetry of the rectangle (Fig. 10-6). Treating them as permutations, 
find their multiplication table. 

22. Do the same for the permutations of the vertices of the square correspond- 
ing to the symmetries. 7 

23. Write the multiplication table of the symmetric group of degree 3. 

*24. We have shown that vectors in the plane form a group under addition. 
Could you generalize this by defining vectors in space of three dimensions in 
terms of three components? 


425. Let (αι, 22,...,2n) be an n-dimensional vector. Define addition so 
that all such vectors with real components form a group. Indicate some sub- 
groups. 


+26. The linear (homogeneous) transformations in the plane are those such that 
(x, y)T = (ax + by, cx - dy). Those for which ad — be ¥ 0 are called non- 
singular. Show that the set of all nonsingular linear transformations forms a 
group. 

*27. An affine transformation in the plane is one for which (2,y)T = 
(az + by + m, cx + dy -+ n). Show that an affine transformation is the com- 
position of a linear homogeneous transformation with a translation in that order. 
Show that the nonsingular affine transformations form a group. (The cor- 
responding geometry is called affine geometry. The thing that remains invariant 
is the parallelism of lines, i.e., two parallel lines remain parallel after any such 
transformation.) 

428. Construct a multiplication table for the symmetric group of degree 4. 


ANSWERS TO EXERCISES 


(a) Otherwise the identity would not be unique, and the composite might 
not always be defined. (Ὁ) This was the essential idea behind the discussion 
in Sections 1-2 and 1-4. (0) Replace 2;7'. by the equivalent x;-+ a. (ἃ) See 
Section 4-3. (e) Tata,b+a (ἢ Yes. (ἡ T-«,-s.  (h) The complex 
numbers under addition. (i) The translation of the axes, where the transla- 
tion of the origin to (—a, —b) corresponds to Τὰ... (k) The second line is 
(cadb). (1) The identity. (m) There is a one-to-one correspondence between 
arrangements of the letters (permutations in the sense of Section 8-3) and trans- 
formations (permutations in the present sense). Indeed, the corresponding ar- 
rangement is just that of the second line in the symbol for the transformation. 
(n) The permutations on {a, ὃ, εἰ. See the paragraph that follows the exercise. 

(o) Just a change of names. “The” is justified, because all of them are iso- 
morphic, i.e., of the same structure. (p) No. There are only 8 symmetries, 
but 24 permutations. (q) n! (r) Its appearance. (5) Distance between 
vertices. (t) abcd; abe + abd + acd + bed; ab-+ ac + ad - be + bd Ἢ ed; a+ 
b-+-c-+-d. (These are called the elementary symmetric polynomials.) (u) If 
f is invariant under ΤΊ and under 7.2, it will certainly be so under 7172. Hence 
the set is closed. The identity transformation does not change f, so belongs. The 
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inverse of a transformation that leaves f invariant will certainly leave f in- 
variant. Finally, all transformations are associative under composition. 

(v) Roughly speaking, nearby points would not then be transferred into 
nearby points, which 15 essential for continuity. (See Section 7-3, which you will 
need to generalize from one to two dimensions. The essential thing is that the 
continuity of Px means that as x — y, F(x) — F(y), i.e., when = is close to y, 
F(x) is close to F(y). In other words, nearby points go into nearby points.) 
(w) Since one can be superposed on the other by distortion short of cutting. 
(x) A figure eight. 


ANSWERS TO PROBLEMS 


1. To think of the points instead of the axes moving, reverses the relation 
between old and new. 3. 1φ 19 = Tose. 5. Tyie = To — 8 = —h+ 2kx. 
But Τὰ γι. = Ta. 7. No. Draw a figure and trace a point through a transla- 
tion followed by a rotation, and through the same rotation following by the 
translation. 9. Suppose that the line has equation y = (tan@)z. Rotate plane 
through —@ to bring line on z-axis, reflect in x-axis, rotate back through @. This 
yields, x” = x cos 26+ ysin 26, y’ = xsin 20 — ycos20. 10. No. 11. All 
properties of curve itself. Invariance of certain functions of the coefficients of 
the equations. See Section 5-17. 13. Stretches the line and shifts it a directed 
distance ὃ. 15. As before, but reverses directions. 17. Check axioms. 19. Iso- 
morphic to addition mod 2. 21. Should be isomorphic to symmetries of the 
square. 23. Should be same as that of symmetries of equilateral triangle if you 
use the same labels. 


10-5 Group theory. The last two sections defined the group concept 
and illustrated it in various ways. Group theory itself is concerned with 
developing laws that apply to abstract groups, independent of such il- 
lustrations. The theory is built solely upon the axioms (10-3-1) through 
(10-3—4) and well-chosen definitions. It is much too vast to be even 
sketched here. One could easily spend a lifetime exploring and adding to 
it. Our purpose here is merely to hint at this great structure by a few 
examples. | 

First of all, it is obvious that any law that holds for addition of real 
numbers or for multiplication of nonzero real numbers, where just one of 
these operations is involved, must hold for any commutative group. 


(a) Why? (b) Why nonzero real numbers? (c) Why commutative 
group? (d) Generalize by referring to any field instead of to just the real 
numbers. 


Furthermore, any law of this kind whose proof does not require the 
commutative law, will hold in any group. Of course, it is naturally most 
interesting to derive laws that hold in any group to replace laws that were 
derived for commutative groups but do not hold more generally. For ex- 
ample, for real numbers (and in any field) we have —(a + b) = —a + 
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(—b) and 1/ab = (1/a)(1/b). The corresponding law for commutative 
groups is (a * b)’ = a’ τ b’, but this does not hold for all groups. How- 
ever, the following law does hold in any group. 


(1) (a * Ὁ)" = δ' « α΄. 


The proof is not difficult: Using associativity and the property of the in- 
verse, we have (ab)(b’a’) = a(bb’)a’ = (ae)a’ = aa’ = e. Since the in- 
verse of an element is the only element whose product with the given 
element yields e, we must have (ab)’ = δ΄ α΄. 

Of course we have not yet proved here that there is one and only one 
element x such that ax = e, that is, 


(2) να 3'χ α Ἐκ =e. 


But this is not difficult. For suppose that ἃ Ἐκ =e anda*y.=e. 
Then a*x—=ax*y. Hence a’ * (a* 2) =a’ * (a*y) err τῷἨ 6 ἘΨ, 
and «x = y. Hence there can be at most one x such that a * x = 6. But 
axiom (10-3-4) guarantees that there is at least one. 


(e) Prove (3). (ἢ Prove (4). (g) Prove (6) and (7). 


(3) Vasdlax*ea =e. 
(4) dizeVax*a= a. 
(5) 1]. γα α τ x= a. 
(6) (α Ἐκ τ α Ἐ ψ) > (r= ¥). 


(7) (x*a=y*a)—- (r= τ). 


(h) Why do we need two such cancellation laws? (They are called the left and 
right cancellation laws). 


We spoke above of “well-chosen definitions.” As an example, we in- 
troduce the notion of subgroup, upon which rests many of the most fas- 
cinating results of group theory. If S is a subset of G and if S and G are 
groups with respect to the same operation, then S is called a subgroup of G. 


(i) Show that every group is a subgroup of itself. (ἡ) Show that any non- 
trivial group has at least one other subgroup. (k) What element of a group 
belongs to every subgroup? (1) Mention several subgroups of the group of 
integers under addition. (m) Find all the subgroups of the group of sym- 
metries of the equilateral triangle. (n) Show that if a group is commutative, 


so must be all of its subgroups. Go: whats we Ἵν τ 


οι δῖε 
πὰ A ay 
ES 
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One of the most remarkable elementary results in group theory is 
Lagrange’s theorem: 


The number of members of any subgroup of a finite group is a 
divisor of the number of members in the group. 


(8) 


(0) What conclusion can you draw about groups with a prime number of 
members? (p) Verify (8) in the situation of Exercise (m). 


To prove (8), let us consider any subgroup S of a group G. Let Sa be 
the set of elements of G obtained by multiplying each member of S on 
the right by a, where a © G. Of course, if a Ε δ, then Sa = S, but 
otherwise Sa is not a group. 


(q) Why not? 


For any a, Sa has exactly as many members as S. For obviously 1t can- 
not have more members, since each of its elements is obtained from one of 
S by multiplication by a. It could have less only if b* a = c * a for 
two different elements 6 and c in S. But this is impossible because of (6). 

Also, for any a and ὃ we have either Sa = Sb or else Sa // Sb, that is, 
two different sets of this kind are always disjoint. For suppose that Sa 
and Sb have a common element c, so that ὁ = ma = nb. Then ior any 
element sa of Sa, sa = sm’ma = sm’nb. That is, any member of Sa is 
also a member of Sb. Hence if Sa and Sd are not disjoint, they are identical. 

Finally, we note that the union of all the sets of the form Sa, for a run- 
ning over the whole membership of G, is just G itself, 1.e., every member 
of G belongs to one of these sets, for the element g of G clearly belongs to 
Sg, since e belongs to S and eg = g. 

What we have shown is that the set G can be partitioned into subsets 
each of which has the same number of members as S. It follows that the 
number of members in G is a multiple of the number of members in (δ, as 
we wished to prove. 


(r) Restate the last paragraph without using the word “partitioned.” 
(s) Write a similar proof based on using sets of the form aS consisting of the 
left multiples of members of S. 


PROBLEMS 


1. Show that in any group (a’)’ = a. 

2. Generalize (10-5-1) to the inverse of the product of any number of 
elements. 

3. How can we be sure that it is impossible from the group axioms to prove 
the commutative law? 

4. Show that in any group for any a and c in the group there exists one and 
only one element 6 of the group such that a * ¢ = ὃ. 
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5. Do the same but with the final equation a * 6 = 6. 
6. What is the relation between the group of symmetries of the square and 
the symmetric group on four objects? 
7. Mention some subgroups of the group of rigid motions of the plane. 
8. Do the reflections in the plane form a subgroup of the rigid motions? 
9, Prove that in any group e’ = e. 
10. Let αἱ = a and a? = a*a. Give a recursive definition of a” for n a 
positive integer. Define αὐ and α΄" for n positive. 
11. Prove that with these definitions the laws of exponents hold. 
12. A group that consists entirely of powers of one of its elements is called 
cyclic. Cite some examples. 
13. List all the subgroups of the group of the 15 fifteenth roots of 1 under 
multiplication. 
14. State a general result of which your result in Problem 18 is a special case. 
15. Show that every group with three or fewer members must be commutative. 
16. Show that a set satisfying axioms (10-3-1), (10-3-2), and the following 
alternatives for (10~3-3) and (10-3~4) is a group: There is an element 6 such 
that δ κα = a. For any a, there is an element a’ such that α΄ * a@ = 6. 
17. Make up another weaker set of axioms similar to those given in Problem 
16 and prove that they are sufficient to define a group. 
* 18. Show that the following axioms define a group. 1. If a and ὁ are in 8, 
then a * bisin S. 2. *is associative. 3. For every a in S there is an α΄ in S such 
that for any bin S, (α τ α) κα' =b. 4 αἘδ Ξ ατὸο-πὸὺῦ - Ο. 
* 19. Make up and justify another set of axioms. 
*20. Prove that every finite group is isomorphic to some group of permuta- 
tations! (Suggestion: Let a1, a2,...,4n be the elements of the group. Then 
consider the set of permutations 41, Ag,..., 42 on these elements, where 
A; is the permutation that carries αἱ into a1a;, a2 into a2a;, a3 into a3a,..., and 
Gn IntO Anda;.) 


ANSWERS TO EXERCISES 


(a) Since such laws are derived only from the axioms for reals that are the 
same as the group axioms. (Ὁ) The reals including zero do not form a group. 
(c) Reals are commutative. (4) Substitute “any field” for “the real numbers.” 
(ὁ) Like (2). (ὃ χα = a and ya = a imply ra = ya implies x = y: Simi- 
larly for other. (g) Multiply on left by a’ in (6) and on the right by a’ in (7). 
(h) We do not have commutativity. (i) By the definition, since every set is a 
subset of itself. (j) The group consisting of only the identity element. (k) The 
identity. (1) Evens, multiples of 3, multiples of 4. (m) The rotations; the 
groups consisting of the identity and Just one reflection; all symmetries; the 
identity. (n) Since all elements of the subgroup belong to the group. (o) They 
have no subgroups but the trivial one and the group itself. (p) The divisors of 
6 are just 2and 3. (q) No identity element. (1) The sets of the form Sa are 
such that each member of Οἱ belongs to one and only one and each such set has 
exactly as many members as S. Hence.... (8) Everything goes exactly the 
same with reversal of order of multiplication. 
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ANSWERS TO PROBLEMS 


1. Since aa’ = e and by (3), there is just one z, namely (a’)’, for which 
va’ = ὁ. 2. Induction! 3. Since we have examples of groups that are not 
commutative. 5. There exists at least one, since (b:a’c) yields a true statement. 
There is at most one, since az = c and ay = cimply x = y. 7. The rotations, 
the translations, the translations parallel to the z-axis, the rotations through an 
even number of degrees, etc. 9. Since ee = e, and use (3). 11. The definitions 
and proofs are exactly the same as in Section 6-6. 13. The five roots with 
angles multiples of 72° (i.e., the five fifth roots of unity), the three roots with 
angles multiples of 120° (i.e., the three cube roots), the single number 1, the 
entire set of 15 roots. 15. Construct multiplication tables for one, two, and 
three members. 17. There is an element e such that ae = a, and for any a 
there is an element “a such that ’aa = e. 19. See “Definitions of Group In- 
volving Quasi-inverse Elements,” by Rafael Sanchez-Diaz, Proceedings of the 
American Mathematical Society, Vol. 4, 1953, pp. 424-428. 


10-6 Boolean algebra. Let B be a set of elements on which are defined 
two binary operations U (cup) and NM (cap) and a unary operation ’ (prime) 
and an equivalence relation = such that 


(1) (a,bEB) > (@UbEBAaNDEBAAd EB). 
For any a, b, andcin B 


(2) aUb=bUa, (27) anb=bna, 
(3) aU(bUc)=(@ub)Uc, (3) an(bne = (and) Ne, 
(4) an(bUc) (anb) ὦ (ano), 
(4) aU(bne = (aUbnauoco), 


i 


(5) aUvUa=a, (5’) ana=a, 

(6) (aU bsv=H=a' ned’, (67) (α Γ 0) ΞΞ α' ὦ δ΄. 
(7) (a)' =a 

Let there be δ and e;, belonging to B, such that 

(8) aUeo = a, (87) aM 60 = 60; 

(9) αι) 61 = 6, (9’) ane; =a, 

(10) aUa’ = é, (10’) α Γ α' = 60. 


Then we call the set {B, U, N, ’, eo, 61} a Boolean algebra. 
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(a) Show that the set B of all subsets of a universe U is a Boolean algebra 
under the operations of union, intersection, and complimentation. (Ὁ) Sug- 
gest names for the axioms (1) through (7). (c) Is a Boolean algebra a group 
with respect to either operation? 


In the algebra of sets we also had binary relations, in particular the 
relation of inclusion C. In any Boolean algebra we may define a similar 
binary relation, as follows: 


(11) Def. — @ Rb = [and = ep. 


(d) To what law of sets does (11) correspond? 
With this definition it is not hard to prove 


(12) eo Ra, (2) ake, 
(13) aka, 

(14) [aRb A ὁ καὶ αἱ — (a= ὃ), 

(15) [aRbA δὴ εἰ - (α καὶ ο), 

(16) (α “αὶ δ) «» (α ὦ ὃ ΞξΞ δ) «» (α Ω ὃ ΞΞ a). 


(6) To what laws of sets do (12) through (16) correspond? 


As a simple example, quite different from sets, let B = {0,1}, eg = 0, 
qa =17’=-1—2a,rUy=ut+y— ὧν, and«ny= σῷ. Then it 
is easy to verify that all the axioms hold. 


(f) Write the tables giving the values of 2’, x U y, and Ζ y for the possible 
values of the variables. (g) What is R in this case? (8) Prove that (4’) 
holds by applying the definitions of the operations to the formulas in both 
members. 

(i) Let B be a nonempty set of propositions that contains the negation of 
each of its member and conjunction and disjunction of each pair of its members. 
Let eo be a false proposition f and e; a true proposition t. Letz’ = ~z,rUy = 
ΜΓ = 2A y, and the equivalence relation be <>. Show that this is 
a Boolean algebra. (j) What is {ἢ in this algebra? (k) Why does it happen 
that sets and sentences satisfy formally identical laws? 


One of the most interesting examples of a Boolean algebra is the algebra 
of switching circuits. Let us imagine an electric circuit between two ter- 
minals 7; and Tg. Figure 10-10 shows the simplest possible circuit, 
a single wire joining the terminals with one switch. Suppose we indicate 
the state of this switch by a variable x that has the value 0 when the 
switch is open and 1 when it is closed. 
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If a circuit contains several switches, we use a different variable for 
each one. Tor example, Fig. 10-11 shows a circuit containing two switches 
in parallel. We say that the entire circuit is closed if current can flow from. 
one terminal to the other, and open otherwise. We let x U y stand for 
the state of the circuit. It takes the value 0 if both xv and y are 0 and the 
value of 1 if either one is 1. 

Figure 10-12 shows a circuit containing two eee in series. We let 
xf) y stand for the state of the circuit. Evidently «My is 1 if both x 
and y are 1, otherwise it is 0. 

Finally, we let x’ be the state opposite to x, so that if y = 2’, then y 
is open and z closed, or else y closed and x open. Such a situation exists 
when one switch is linked to another so that when one closes the other 
opens. A simple possibility is sketched in Fig. 10-13. In practice, switches 
are usually linked by relay mechanisms consisting of coils and magnets, 
or by electronic devices. 


(1) Write simple formulas for a’, a U ὃ, aN ὃ, in terms of the usual arithmetic 
of Ὁ and 1. (m) Show that the above is a Boolean algebra. (n) Do you 
think it might be possible to construct machines to solve logical problems? 
Why? 


It should be noted that = in switching algebra is the equivalence rela- 
tion. It does not mean that the circuits are the same, but merely that 
their states are the same, since the variables stand for the states of the 
circuits. Thus (6) means that a circuit consisting of two switches a and 6 
in parallel is always in the state opposite to that of a circuit consisting of 
two switches c and d in series that are respectively in the opposite state 
from the originals, i.e., for which c = a’ and d = δ΄. The two circuits 
are sketched in Fig. 10-14.. In the figure, α and ὃ are shown both open, 
which implies that a’ and b’ are closed. As is evident in the figure, the 
circuit α U b is open and the circuit a’ Ω b’ is closed. If we make any 
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change that closes the first circuit, the second circuit opens. This is the 
meaning of (6) in the present interpretation. 


(o) Draw diagrams to illustrate other laws of Boolean διβου τα as applied to 
switching circuits. (p) What is R here? 


Because the algebra of switching circuits is Boolean, this type of ab- 
stract system plays a very important role in the design of telephone net- 
works, electronic “brains,” and other circuits. To give some idea of the 
applications, we consider a very trivial example. The circuit in Fig. 10-15 
has the formula (a N δ) U (ec Na). But by the laws of Boolean algebra, 
this formula is equivalent to a M (b Uc), whose corresponding circuit is 
given in Fig. 10-16. But this second circuit involves less wiring and one 
fewer switches. In a similar way, very complicated circuits may often be 
replaced by simpler ones with very substantial savings. 


(q) Prove that in any Boolean algebra 
(17) aU (anb) = 


by giving reasons for the following; aU (aN ὃ) = (ane) ὦ (α Ω δ) = 
α΄ (ει Ub) =ane=a. (Ὁ) Draw diagrams of the two members in- 
terpreted as circuits. 


PROBLEMS 
1. Prove the following laws of Boolean algebra. 
(17) αὖ (α Ὁ δ) Ξ α, (17) aU (an δ) =a, 
(18) aN(a’Ub)=anb, (18) aU (a’nb)=aVUb,. 
(19) aR(aUd), (19) (aN δ) Ra, 
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(20) (aN δ) U (α' Nb) = (aUc)N (α' Ud), 
(21) (aNeU (ad Nb) ὦ (Nec) = (ane ὦ (and). 


2. Show that eg is the only element zx such that x R eo. 

3. Show that (aM ὃ) U (a Nb’) = (aU 8) N (α' ὦ δ). 

4. Draw the circuits corresponding to the two members of Problem 3. 

5. Argue that if a light is placed in either of the circuits of Problem 3 ap- 
propriately, it can be turned on or off from either of the switches a and b. 

6. State some other identities of Boolean algebra. 

*7. Show that the complement of any expression built up from primed and 
unprimed letters, cups, caps, and parentheses (but no primed parentheses) is 
obtained by interchanging cups and caps, priming each unprimed letter, and 
unpriming each primed letter. 

8. Show thatif AN X= AN YandAUX=AUY, then X = Y. 
9. Show that neither one of these hypotheses is sufficient for the conclusion. 
10. What are the undefined constants, formulas, and sentences in the presen- 
tation of Boolean algebra in this section? What formulas are defined? What 
are the variables? 
11. Could the number of undefined formulas be reduced? 
% 12. A farmer wants to be able to turn on or off his yard light from three dif- 
ferent locations. Design the circuit for the three switches. 
%*13. A man has to get a wolf, goat, and a basket of cabbages across a river by 
using a boat that will hold only himself and one of the three. He must not 
leave the wolf and goat, or the goat and cabbages, together out of his presence. 
How does he get them across? Do you see any connection with Boolean algebra? 
* 14. Argue that any law of Boolean algebra involving cup, cap, and prime 
only, remains true when cup and cap, and 60 and 61 are interchanged. 
* 15. Prove (22). 


(22) (aNb)U (and) U (a Nb) U (α' ND) = a1. 
% 16. Prove (23). 
(23) (a = 60) © [vb (an δγυ (a Nb) Ξε δ]. 


* 17. Show that (a Rb) «» (a’ Ub= 61). 

* 18. Show that every Boolean algebra has a subalgebra of Just two elements. 
(First you must define a subalgebra!) 

* 19. Show that the elements of a Boolean algebra form a group under * defined 
bya*b = (aNd) U (a’ Ω ὃ). 


ANSWERS TO EXERCISES 


(a) Each of the axioms appears in Section 3-7. (Ὁ) See Section 3-7 
(c) No, inverses do not always exist. (d) (8-8-16). (6) See Section 3-8 
(g) <. (h) aU (bNc) =at+be—abe. (aUb)N(aUec) = (α - ὃ — 
ab)(a + ς — ac) = a®-+ ac — a®e + αὖ + be — abc — αὖ — abc + arbc = 
a+ be — abe. (i) Again, the axioms appear in Chapter 2 as laws of logic. 
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Note that (9) becomes p V ἐς. Recalling that ¢ is a true proposition, this 
is equivalent to (2—-7-32). Similar interpretations have to be given to other 
identities involving ἐ and f. 

(j) —. There is a double use of — here, both as the relation of the Boolean 
algebra and as the implication sign for stating laws of the algebra as, for ex- 
ample, (14. (k) Each operation and relation on sets was defined in terms of 
the corresponding operation and relation on sentences. (1) They are just those 
of the Boolean algebra of {0,1}. (m) Obvious from (1). (n) Yes, since elec- 
trical switches being open and closed could corresponding to sentences being 
false or true, and a circuit would then correspond to a logical formula. (p) a Rb 
means that if a is closed, so is b. (4) (9’), (4), (9), (9’). 


ANSWERS TO PROBLEMS 


1. Just like corresponding proofs for set theory. 3. Left member = (aU a) 
(α κι. δ) Ο (δ) αἼ ἢ (ὁ ὦ δ) =aNa@UB)NbOUad)Ne = right mem- 
ber. 5. The circuit on the left is closed if either both switches are on or both are 
off. Hence any change at either switch changes the light. Same statement applies 
to right member. 9. Counterexamples: A = eg and A = e,. 11. Yes, just as 
we did in Chapter 2 by defining cup (or cap) in terms of cap (or cup) and prime. 
17. Prime both sides of the defining term in (11). 19. Verify each axiom. 


10-7 Axiomatics. In this chapter we have described three abstract 
mathematical theories. In each case the theory was constructed by speci- 
fying certain undefined constants variables, and formulas, then formulat- 
ing axioms, definitions, and theorems derived with the aid of logic and 
rules of proof. These three theories were natural abstractions from the- 
ories of a less abstract nature. It would appear that we could construct 
many such abstract theories by incorporating in an abstract theory cer- 
tain features of familiar theories. For example, the theory of groups takes 
the properties of closure, associativity, identity, and inverse. Could we not 
omit associativity and include commutativity instead in order to get a 
different theory? Yes, of course. Moreover, we are not required to use 
properties from familiar theories. Indeed we are free to choose our terms 
and axioms as we wish and even to specify our rules of proof and logical 
procedures at pleasure. This notion of freedom in constructing theories 
is one of the essential features of modern mathematics. 

Of course, the fact that we may construct theories as we wish does not 
mean that the theories we construct will inevitably be interesting, useful, 
or aesthetically appealing to others or even to ourselves. With the freedom 
of mathematical construction comes inevitably the necessity of a critical 
examination of the results. Since a theory is completely determined by 
its axioms, a critical examination of a theory naturally centers on its 
axioms. Axzomatics is the theory of axioms systems. 
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One of the first things we might ask about an axiom system is whether 
or not it is consistent. A set of axioms is called inconsistent if we can derive 
from it a contradiction, that is, some proposition of the form p A ~>. 
An axiom system is called consistent if this is not possible. If we discover 
a contradiction, inconsistency is proved, but our failure to discover one 
is not conclusive proof that none exists. The accepted method of proving 
consistency is to exhibit a set of things that satisfy the axioms, that is, to 
point to an example or realization of the theory. 

For example, we may establish the consistency of the group axioms by 
showing that the three cube roots of one form a group under ordinary 
multiplication. The idea behind this method of demonstrating consistency 
is the faith that the world is not inconsistent, and hence that if the axiom 
set is satisfied by some real objects, then the axioms must be consistent. 
However, when we show that a theory is realized in an example, we merely 
show that it is consistent with our theory of this example. For example, 
when we exhibit the integers under addition as an example of a group, we 
show that group theory is consistent if the theory of the integers under 
addition is consistent. Hence, such a proof of consistency is relative to 
some assumptions and has to go outside mathematics for its final justifi- 
cation. In practice, consistency means that the theory applies to some 
other theory that is accepted because of its connection with experience. 

(a) Give a definition of “example” as we have used this word above. (b) Show 
that the axioms of Boolean algebra are consistent if the theory of the arithmetic 
of integers is consistent. (c) Do you think the arithmetic is consistent? Why? 


Another question we might ask is whether a set of axioms is as simple 
as it might be. Could some be derived from the others? If so, the former 
could be omitted. It would be worth knowing whether some axioms could 
be proved as theorems, even if we preferred (as is often the case) to use 
an axiom system larger than absolutely necessary. As axiom system no 
one of whose members can be derived from the others is called independent. 

The independence of a particular axiom may be shown by finding an 
example for which the axiom is false and the other axioms are true. If 
this can be done, it is evident that the axiom could not be derived from 
the others, for such a derivation would amount to proving that the axiom 
must hold if the others do. 

(4) Cite an example of a set of numbers that satisfy the group axioms except 
for closure. (6) Cite an example in which all group axioms are satisfied except 
the existence of inverses. (f) Can you cite one in which only the existence of 
the identity fails? (g) Explain: An axiom system is independent if and only 
if every system obtained by replacing one axiom by its negation is a consistent 
system. 

There is a stronger kind of independence called complete independence. 
This is said to hold for a set of axioms if no subset of the axioms implies 
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the disjunction of the others. For example, if the axiom set is {p, q, 7, s}, 
then p A ᾳ does not imply r V 8, nor does any other conjunction of 
axioms imply the disjunction of the rest. This means that every axiom 
system made up by picking certain of the axioms and the negations of the 
remainder is a consistent system. For example, if there are three axioms 
{p, q,7r}, then complete independence means that {p,q,r}, “Ὁ, 9,7}, 


ἱρ, ἀκ r}, ἱρ, 4, αὙἹ, {~D, ~4q, ae {~D, q; ~r}, ἱρ, “(, ~T}, and {~D, 
~q, ~r} are all consistent systems. 


τος (ἢ) Show that the axiom system consisting of the commutative law, the asso- 
ciative law, and the closure law is completely independent. 


Another quality that may be possessed by an axiom system is com- 
pleteness. A system is said to be complete when it is impossible to introduce 
an additional axiom that is independent of the original axioms. In other 
words, any further true propositions must be consequences of the given 
axioms. Of course, by further axioms and propositions we mean those 
involving only terms of the given theory. 

Just as for the notion of consistency, there seems no way to establish 
completeness directly. To deal with consistency, we introduced the idea 
of the existence of an example, which we took as a sufficient condition for 
consistency. For completeness we use a similar strategy. An axiom sys- 
tem is said to be categorical if all its examples are isomorphic. Since any 
two theories that satisfy a set of categorical axioms are isomorphic, they 
differ only in the symbols used or in their interpretation. Every true sen- 
tence in one has a twin in the other. Two such theories are “essentially 
the same” in the same sense that they do not differ in any way as far as 
the theory is concerned. 

It is not hard to see that categoricalness implies completeness, for if an 
independent axiom could be added, the systems obtained by adding it and 
by adding its negation would both be consistent. It would therefore be 
possible to construct two examples satisfying the original axioms, one of 
which included the new axiom and the other its contradiction. These 
could certainly not be isomorphic without a contradiction. 


(i) The previous argument proves actually that incompleteness implies non- 
categoricalness. Why does this prove that categoricalness implies completeness? 
(1) Show that the group axioms are not categorical. (k) Show that the group 
axioms are not complete. (1) Do you think it is true that completeness im- 
plies categoricalness? (m) Show the completeness of the axiom system con- 
sisting of the group axioms and the following additional axiom: G contains 
exactly three members. 


The qualities mentioned above are certainly important ones, but there 
is no reason to think that they are the only important aspects of axiomatic 
systems. Nor should it be assumed that the named qualities are desirable 
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under all circumstances. If consistency is interpreted as absence of con- 
tradictions in the broadest sense, it is not a desirable feature of a theory 
that is intended to describe a situation involving contradictions! If con- 
sistency is interpreted as applicability, it may not be desirable for a 
theory that is intended as an object of amusement. An independent axiom 
system may be less useful in teaching a subject than a dependent system 
that is more appealing and easier to use. An incomplete system is more 
powerful than a complete one in the sense that it has a larger variety of 
applications. A categorical system is desirable only if we wish to construct 
a precise and elegant theory of a unique kind of structure. In short, the 
criteria discussed above are some of the important ones to apply to a 
theory in order to understand its nature and to see how it is adapted to 
the purposes that motivated its creation. 


PROBLEMS 


1. Show that the axioms of logic are consistent if elementary arithmetic 
is consistent. 

2. Show that the axioms of logic are neither categorical nor complete. 
Would it be desirable for them to be so? 

3. Show that any inconsistent set of axioms is complete. 

4, It may be objected that since basic terms are undefined and axioms un- 
proved, a mathematical theory gives no final proof or explanation of anything. 
Discuss this. 

5. Construct an abstract theory generalizing the notion of distance. 

6. Consider a field with the additional operation © defined by ao ὃ = 
a+ b6— ab. Is there an identity? inverse? associativity? Derive as many 
theorems as you can. 

7. Generalize the notion of an ordered pair to an ordered n-tuple (ordered 
set of m members). Include a definition of the cartesian product of n sets. 

8. Let F be a function satisfying the following axioms: 1. The domain of 
F is the n-factor cartesian product UX UX UX... Χ U, and its range is U, 
where U = {—1,0,1}. 2. If 2’ is obtained by permuting the elements of z, 
then F(z’) = F(x). 3. If 2’ is obtained from z by negating every element of z, 
then F(z’) = —F(z). 4. If F(x) > 0, and 2’ is obtained from z by increasing 
only one element of x, then F(z’) = 1. Prove that for a fixed n, these axioms 
are consistent, categorical, and completely independent. (For an interpretation 
of these axioms, see “A Set of Independent Necessary and Sufficient Conditions 
for Simple Majority Decision,” in Econometrica, October, 1952, and “A Note 
on the Complete Independence of the Conditions for Simple Majority Decision,” 
Econometrica, January, 1953.) 


ANSWERS TO EXERCISES 


(a) An interpretation satisfying the axioms. (0) The Boolean algebra 
of {0,1} is an example. (ὁ) Experience. (d) Integers between —3 and 3 
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under addition. (6) Reals under multiplication. (f) In the form in which 
we stated the axioms, the statement of the existence of an inverse depends on 
the existence of the identity. (g) Since we find examples to prove the inde- 
pendence. (h) Cite examples. (i) p> q=~q—7~qg. () We know of 
groups with different numbers of members. (k) We can add an axiom of com- 
mutativity. (m) Only one possible multiplication table. 


10-8 What is mathematics? Mathematics appears in our culture in 
so many varied ways that it is difficult to describe it as a whole. We wish 
here merely to suggest a tentative answer to the title of the section. 

The child first meets mathematics when he begins to count and to do 
simple problems in arithmetic. He thinks of mathematics as having to 
do with calculations which become more complicated as he advances in 
school. In high school he finds that geometry is part of mathematics, too, 
and at this stage he might be satisfied with the old definition that math- 
ematics is the science that deals with numbers and space. Indeed, this 
is a generally accepted view of mathematics. Certainly arithmetic and 
geometry are part of mathematics, but the reader of this book can see 
that they are merely excerpts from certain parts of mathematics, which 
is far too broad to be characterized as dealing with merely numbers and 
space. 

One of the characteristic features of mathematics is its special sym- 
bolism. This had led some people to say that mathematics 18 a language. 
Certainly mathematical language has some of the characteristics of every- 
day languages. For example, people can communicate mathematically 
almost without using the usual verbal languages. However, every math- 
ematical theory could be restated without the familiar special symbolism. 
The results would be most awkward and inconvenient, but the possi- 
bility shows that the symbolism of mathematics is not mathematics it- 
self. Indeed the view that mathematics is a language involves the con- 
fusion of symbols and their use. Mathematics, like any other discipline, 
has special terms and symbols, but mathematics is the subject that is 
expressed by these symbols and not the symbols themselves. 

Another feature of mathematics is its wide applicability. It is used by 
everyone, from the clerk making change in the store to the physicist 
setting up a differential equation. This has lead to mathematics being 
described as a tool. Of course, it is true that mathematical theories are 
useful. Many are developed in answer to specific scientific needs, and ex- 
perience suggests that even those theories that grow without any practial 
motivation eventually are found useful by someone. However, to define 
mathematics in terms of its applications would be to exclude much math- 
ematics that is developed without regard to applications and has not as 
yet found any. 
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The reader of this book has seen that mathematicians feel free to choose 
basic terms, axioms, and definitions at will. On the basis of any such 
choice, one can build a theory by the discovery and proof of theorems. 
Evidently mathematical work has a creative aspect. This has led to 
mathematics being described as a creative art. It is certainly true that the 
creative mathematician has much in common with the creative person in 
music and the visual arts. Both have a freedom to choose their forms and 
structures and both must accept the logical and aesthetic consequences of 
such choices. But this characterization of mathematics is also biased 
since it ignores the important link with applications. 

Perhaps one might give up the attempt to define mathematics in any 
simple way and simply say that it is what mathematicians do and that 
mathematicians are those people in our society with certain kinds of 
education and professional memberships. Such a definition could hardly 
be wrong, but it is not satisfying if one wants to understand what it is 
that mathematicians do! 

Let us look at mathematics as seen in this book and in the reader’s 
previous studies. It appears as a collection of abstract theories. These 
theories differ from the abstract theories of other sciences (such as 
those of physics, biology, or economics) by being on a higher level of ab- 
straction, that is, they do not include any concrete interpretation but are 
capable of many different interpretations. When mathematical theories 
are expressed formally in terms of axioms and theorems, this abstractness 
is reflected by the fact that the undefined terms remain ambiguous in 
their meaning. Some mathematical theories have many applications out- 
side of mathematics (for example, the theory of growth and decay) and 
other theories apply to various theories within mathematics itself (for 
example, group theory). Theories of the latter sort are considered to be 
more abstract, and some mathematicians use the term “abstract math- 
ematics” only in connection with them. We might be tempted, then, to 
define mathematics as a collection of abstract theories, but there are cer- 
tainly many abstract theories that are not mathematical. 

A common characteristic of the mathematical theories familiar to the 
reader is their logical character, their organization in terms of axioms, defi- 
nitions, and proofs. Of course, this feature is common to theories outside 
mathematics also, for example, in theology and physics. 

The above discussion suggests that it may not be possible to give a 
definition of mathematics in terms of any single one of its characteristics. 
However, 1f we include several of them we may find one that is neither too 
restrictive nor too inclusive. Here is such an attempt: Mathematics is the 
collection of known abstract, axiomatic, consistent theories. That is, 
mathematics is the set to which an object belongs if and only if it is a 
theory that is abstract (in the sense of having multiple interpretations), 
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axiomatic (logically organized), and consistent (having at least one appli- 
cation or example, as indicated in Section 10-7). This characterization 
seems to fit the criteria used by mathematicians. An alleged contribution 
to mathematics is rejected as mathematics if it is limited to a particular 
application outside mathematics (not sufficiently abstract), fails to be, or 
to be a part of, an axiomatic theory, or fails to have any realizations 
(within or outside of mathematics). 

Other aspects of mathematics, such as its special symbolism, its aesthetic 
values, the creative activity associated with it, its numerous inter-relations 
with other disciplines, and its role in various other aspects of our culture 
are not part of mathematics itself but rather related to its creation and 
use by human beings. 


FURTHER READING 


In place of a set of problems, we follow this section by a list of books in which 
the reader will find stimulating ideas on the nature of mathematics and further 
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TABLE I 


Roots AND RECIPROCALS 


Ζ 41,32 413 1000/z Ζ «1:2 gills 1000/zx 
1 1.000 1.000 1000.00 36 6.000 3.302 27.76 
2 1.414 1.260 500. 00 37 6.083 3.3082 27.03 
3 1.732 1.442 333.33 38 6.164 3.362 26.32 
4 2.000 1.587 250.00 39 6.245 3.391 25.64 
ὃ 2.280 1.710 200.00 40 6.325 3.420 25.00 
6 2.449 1.817 166.67 41 6.403 3.448 24 89 
7 2.646 1.913 142.86 42 6.481 3.476 23.81 
8 2.828 2.000 125.00 43 6.557 3.503 23.26 
9 3.000 2.080 111.11 44 6.633 3.530 22.73 
10 3.162 2.154 100.00 45 6.708 3.557 22.22 
11 3.817 2.224 90.91 46 6.782 3.583 21.74 
12 3.464 2.289 83.33 47 6.856 3.609 21.28 
13 3.606 2.351 76.92 48 6. 928 3.634 20.83 
14 3.742 2.410 71.438 49 7.000 3.659 20.41 
15 3.873 2.466 66.67 50 7.071 3.684 20.00 
16 4.000 2.520 62.50 51 7.141 3.708 19.61 
17 4.128 2.571 58.82 52 7.211 3.733 19.23 
18 4.243 2.621 55.56 53 7.280 3.756 18.87 
19 4.359 2.668 52.63 54. 7.348 3.780 18.52 
20 4.472 2.714 50.00 55 7.416 3.803 18.18 
21 4.583 2.759 47.62 56 7.483 3.826 17.86 
22 4.690 2.802 45.45 δ 7.550 3.849 17.54 
23 4.796 2.844 43.48 58 7.616 3.871 17.24 
24 4.899 2.884 41.67 59 7.681 3.893 16.95 
25 | 5.000 | 2.924 | 40.00] 60 | 7.746 | -3.915 | 16.67 
26 5.099 2.962 38.46 61 7.810 3.936 16.39 
27 5.196 3.000 37.04 62 7.874 3.958 16.13 
28 5.290 3.037 35.71 63 7.9387 3.979 15.87 
29 5.385 3.072 34.48 64 8.000 4.000 15.62 
30 5.477 3.107 33.33 65 8.062 4.021] 15.38 
31 5. 568 3.141 32.26 66 8.124 4.041 15.15 
32 5.657 3.175 31.285 67 8.185 4.062 14.93 
33 5.745 3.208 30.30 68 8.246 4.082 14.71 
34 5.831 3.240 29.41 69 8.307 4.102 14.49 
35 5.916 3.271 28. 57 70 8.367 4.121 14.29 
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TaBLeE I (Continued) 


Roots AND RECIPROCALS 


71 8.426 4.141 14.08 86 9.274 4.414 11.63 
72 8.485 4.160 13.89 87 9.327 4.431 11.49 
73 8.544 4.179 13.70 88 9.381 4.448 11.36 
74 8.602 4.198 13.51 89 9,434 4.465 11.20 
75 8.660 4.217 13.33 90 9.487 4.481 11.11 
76 8.718 4.236 13.16 91 9.539 4.498 10.99 
71 8.775 4.254 12.99 92 9.592 4.514 10.87 
78 8.832 4.273 12.82 93 9.644 4.531 10.75 
79 8.888 4.291 12.66 94 9.695 4.547 10.64 
80 8.944 4.309 12.50 95 9.747 4.563 10.53 
81 9.000 4.327 12.35 96 9.798 4.579 10.42 
82 9.055 4.344 12.20 97 9.849 4.595 10.31 
83 9.110 4.362 12.05 98 9.899 4.610 10.20 
84 9.165 4.380 11.90 99 9.950 4.626 10.10 
85 9.220 3.397 11.76 |} 100 | 10.000 4.642 10.00 
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TABLE II 
TRIGONOMETRIC FUNCTIONS 


Angle 


Cosine Tangent | Cotangent 
Radians Degrees 


Soqoqcco Ss. oo oc So 
oe SoS co oo 
Seoo ac: ce oc Sc 


ΞΘ 2 GS 
OS OS oS Oo 
So OOo: 
ἘΠ oo 


0 0 0. 1 1 
0 0 0. 1 0 
0 0 0. 1 0 
0 0 0. 1 0 
0 0 0. 1 0 


a μα 
Ooo OO © Ὁ 
Nb be τσν 
ΘΘΩΟΩ͂ Φ oS 


μι μι μ μ᾿ ee 
oc oO & 
So Oo Oo 


— 
μ- 
Θ 


* Undefined. 
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TABLE III 


CoMMON LOGARITHMS 
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TABLE IV 


NATURAL LOGARITHMS 


a OOO © QoQ oo oo 
eo o a © aon oa a 
μιι ee μ 5 
aww 2 
Coo © 
Cc μ ὦ 


rere Oo ΟΦ ΘΟ ΘΘ. ΘΟΘΟΘΦς φ ππΠπΠ᾽.9Θ 


1 
2 
3 
4 
ὃ 
6 
7 
ὃ 
9 
0 
1 
2 
3 
4 
ὃ 


CONNNN NONNNND NORSK Ee 
CO OND ARP WNRK ς ὦ OAS 


oS rR WL ἡ ὦ Non μα = = 


| 


Con nok ὧν μα 


RANDOM SEQUENCE 
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TABLE V 


1100100111 
0111111101 
1001010011 
1001011000 
1101100101 
0001100001 
0011110011 
1110010011 
1001010110 
0010100001 


1111101010 
0100111011 
0000110011 
0001010001 
1110100100 
0011101000 
1011100100 
0000110111 
1000110010 
1110110110 


1111100010 
1110101110 
0001111111 
1000110011 
0101011101 
0110100010 
0001111010 
0110010011 
0101011100 
0011011010 


1100000100 
0001001010 
1010000101 
0011010000 
1010010001 
1011010111 
0101000101 
1101100001 
1011001110 
0010111000 


0100011001 
1110011001 
1011001100 
1001111110 
1101111000 
1001100100 
0010111000 
1100110001 
1110001100 
1011000100 


0101000100 
1000001110 
1110110101 
1101000001 
0111110000 
0110100011 
0010000110 
1000101000 
0100000111 
1001010001 


0011010010 
1100001110 
1010101110 
0101010110 
0100101011 
0010100000 
0011100000 
1010101000 
1001111011 
0011011110 


1110011011 
0111011100 
0000010010 
1110101110 
0100101100 
1010001000 
1100100101 
0111100001 
0000111101 
0100101101 


0111011100 
0000110001 
1010001001. 
1000011110 
1011111100 
0001110000 
0101001011 
0110100111 
1000111110 
0110110110 


1100000101 
0001111100 
1001001011 
1010001000 
1111010001 
1111011100 
1110100010 
1001010000 
0110010101 
1001001000 


p= 1/2 


1011111100 
1111110001 
1100011000 
1100011111 
1111001110 
1011100101 
1001001010 
1100011110 
1000101101 
1111001110 


1000000101 
1101011100 
0111010101 
1100110010 
0000000111 
0111000011 
0011101011 
0111001110 
0010011101 
0100011110 


0100010100 
1001011100 
1111001101 
0110101000 
1001101001 
0110111100 
1110100110 
0001000011 
1010001000 
1100100101 


0011011000 
0011010100 
1110101101 
0001110010 
1101001010 
0110010100 
0100001010 
1110011110 
1001101111 
0011111000 


0010101101 
0011000111 
0010010100 
0000111101 
1101101001 
1000101001 
1111111100 
1111010001 
1101101001 
0010011111 


0110000011 
0110101010 
1141000111 
0101000001 
1101001001 
0101000101 
0101001000 
1001001010 
1010000110 
1011000011 


1001110101 
1011101101 
1010110001 
0000100110 
1001001001 
1000011100 
1011010110 
0011000101 
1110010101 
1011010110 


0011100110 
1010100001 
0101000100 
1101011101 
1100100101 
1000001111 
0001111111 
0100000111 
0011010000 
1011010000 


1010011100 
1001000111 
0000110000 
0010000111 
0000111001 
0011010111 
0011100101 
1011110010 
0101111110 
1000000001 


1011111001 
0010110110 
1000111001 
1000100001 
1110111110 
LIOLOLOLIL 
0100111100 
1101100111 
1010110001 
0000010001 


0111101100 


0011010110 
1001111111 
0111110011 
0111000100 
0010011001 
1110010000 
0011011011 
0110101110 
1000000100 


1111011011 
1110011101 
1011010010 
0011100111 
1011101011 
0111111100 
1000011001 
1110100011 
0101000010 
0010110111 


0111001111 
0100100101 
1100010010 
1000110100 
0100101001 
0001011000 
0000000111 
1000100110 
1111110000 
0011101000 


0010111101 
1100010010 
0010111011 
1011000000 
0010110000 
0101001000 
0010111101 
1011010110 
1000011110 
1101111011 


1111110111 
0101100111 
0001011000 
0010111101 
1111101011 
0110001000 


0111100100 


0011111011 
0100000111 
1111000110 
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TABLE VI 
NORMAL DISTRIBUTION 


f(2) = (Ὡπ)- 126-82 Ai) = fis 


F(z) H(z) | f(2) H(z) 


0.0 
0.1. 
0.2 
0.3 
0.4 
0.5 
0.6 
0.7 
0.8 
0.9 
1.0 
1.1 
Lee 
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TABLE OF SPECIAL SYMBOLS 


After each symbol is given a brief indication of meaning and references 
— to first use and definition. 


ARITHMETIC AND ALGEBRA 


— operation of negation, 1-2; operation of subtraction, 1-4. 
1/a_ reciprocal of a, 1-2. 
+. operation of addition, 1-4, 6-3, 6-8, 6-9, 6-13. 
operation of multiplication, 1-5, 6-3, 6-8, 6-9, 6-13. 
(a:b = ab) 
+ operation of division, 1-4. (a + ὃ = a/b) 
a*,a°,... powers of a, 1-5, I-18. 
a” nth power of a, 6-6, 6-8, 6-9. 
ja| absolute value of a, 1-4, 1-13, 3-9. 
fa non-negative square root of a, 1-13. 
Wa _ real cube root of a, 5-5. ; 
principal fourth root of a, 5-6. 
less than, 1-7, 3-5. 
greater than, 1-7, 3-5. 
less than or equal to, 2-2, 3-5. 
greater than or equal to, 2-2, 3-5. 


SIVIAVA S$ 


percent, 1-13. 
a<a«<b a less than ὦ and z less than 0, 3-4. 
(α δ), (a δ), (a __b), (a2 ὃ) intervals, 3-4. 
3.214 repeating decimal, 3.214141414..., 1-15. 
N natural numbers {1, 2, 3, ...}, 6-2. 
G the digits, 5-2. | Ἢ 
J integers, 3-2. 
J+ positive integers, 3-2, 6-8. 
ΗΕ even numbers, 3-2. 
D_ odd numbers, 3-2. 
δα rational numbers, 3-2. 
Ra* positive rationals, 6-8. 
Re real numbers, 3-5. 
Ret positive reals, 3-5. 
Ρ prime numbers, 3-2. 
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{a,b,...} 
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a divides b, 5-3. 

largest member of S, 3-9. 

smallest member of δ, 3-9. 

cardinal number of the set S, 3-4, 6-2. 

aleph null, 6-2. 

the successor of x, 6-4. 

n-factorial, 6-6. 

summation, 6-7. 

continued product, 6-7. 

complex number, 6-18. [(a,b) = a -+ 7b] 
complex number in polar form, 4-10, 6-13. 
complex conjugate, 6-13. 

(0,1), 6-13. 

base of natural logarithms, 7-14. (e = 2.71828). 
ratio of circumference to diameter of a circle, 1-2. 
approximately equal to, 1-4, 6-11. 


LOGIC AND SETS 


relation of identity, 1-6. 
relation of nonidentity, 1-6. | 
not, logical negation, 2-2. 

or (= and/or), 2-2. 

and, 2-2. 

or else, 2-2. 

imphies, 2-3. 

if and only if, 2-3. 

therefore, 2-6. 

for all x, 2-10. 

for some x, 2-10. 

for one and only one 2, 2-11. 
a-sub-z, 1-3. 

substitution, 1-11. 

is ἃ member of, 3-2. 

is not a member of, 3-2. 

set whose members are a, b, ... , 3-2. 
set of x’s such that... , 3-2. 


(x, y) 

((x, | f(x, y)} 
AxB 

u 

0 

I(x, y)|, [ul 

(7, 8) p 

6 

m 

AB 


f(z) 
Say) 
Dom (p) 
Rge (p) 
Fun 

I 

Ta 


sin, cos, tan, cot, sec, esc 
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empty set, 3-3. 


universal set, 3-3. 

is a subset of, 3-4. 

power set of A, 3-4. 
complement of A, 3-6. 
intersection of A and B, 3-6. 
union of A and B, 3-6. 
exclusive union of A and B, 3-6. 
relative complement of B, 3-6. 
A and B are disjoint, 3-8. 

is a proper subset of, 3-8. 


overlaps, 3-8. 


the x such that... , 3-9. 
the set A is similar to the set B, 6-2. 


ANALYTIC GEOMETRY 


ordered pair, vector, point, 4-2, 4-3, 6-13. 
the set of ordered pairs (x,y) such that f(x,y), 4-2. 
cartesian product, 5-2. 


vector, 4-3. 


null vector (0, 0), 4-4. 

length of vector, 4-4. 

point with polar coordinates r and 6, 4-10, 6-13. 
eccentricity of a conic, 5-13 through 5-17. 

slope of straight line, 4-5. 

distance between A and B, 4+. 


FUNCTIONS AND RELATIONS 


f-of-x, image of x under f, 2-9, 5-4. 
f-of-x-and-y, 2-9. 
domain of p, 5-3. 


range of p, 5-3. 


the set of functions, 5-4. 
identity function on the reals, 5-7. 
identity function on A, 5-9. 


circular functions, 4-11, 4-12. 
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mI -+ 6 linear function, 5-7. 
δ, constant function, 5-7. 
|Z| the norm, 5~7. 
sg signum function, 5-7. | 
[x] the largest integer not greater than x, 5-7. 
al* + bf +c quadratic function, 5-8. — : 
peo composite of p anda, 5-9. 
16] composite of G and F (for functions only), 5-9. 
ρ΄ converse of p, 5-10. 
b’ exponential function, 6-6, 7-13. 
e exponential function to the base e (= exp 7), 7-14. 
exp, exponential function to the base b, 7-13. 
log, logarithmic function to the base b, 7-13. 
In log,., 7-14. 
Yn nth term of a sequence, 6-6. 


LIMITS AND CALCULUS 


Df derivative of f, 7-2. 
x πα x approaches a, 7-2, 7-38. 
lim f(z) the limit of f(x) as x — a, 7-3. 


7 (Ω) = Df(x) = df(x)/dx = Ὁ, f(x) = f(x), 7-4. 
ὉΠ ἢ antiderivative, 7-9. 
An, An upper and lower sums, 7-10. 
T,, Zp Upper and lower sums, 7-11. 
1,1 upper and lower integrals, 7-11. 


b . 
f definite integral of f from a to 8, (= ἢ f(x) az), 7-11. 


f definite integral from minus to plus infinity of f, 8-7. 


dx, dy, df(«) differential, 7-17. 
Ax increment in x, 7-4, 7-17. 


PROBABILITY AND STATISTICS 
P(X] probability of X, 8-1. 
C” combinations of n things r αὖ ὃ time, (= (2) , 84. 


Cr eke 

1’ 12 k 

ΡΙΑΙΒῚ 

Elg(x)] 
μ 

σ 

θ 

6 

ok 

U, A, ” 

61, €0 
A a 
B β 
Γ Y 
A ὃ 
E € 
Z ζ 
Η η 
Θ θ 
I l 
K K 
A λ 
Μ μ 
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number of ordered partitions, 8-4. 


probability of A given B, 8-5. 
expectation of g(x), 8-6. 
mean, 8-6, 8-7. 

standard deviation, 8-6, 8-7. 
estimator of 6, 9-3. 


ABSTRACT THEORIES 


identity element, 10-2. 

unspecified binary operations, 10-1. 
operations of Boolean algebra, 10-6. 

special elements of a Boolean algebra, 10-6. 


GREEK ALPHABET 


Alpha N ν Nu 
Beta = ἕξ Χι 
Gamma O O Omicron 
Delta II v Pi 
Epsilon P p Rho 
Zeta > σ sigma 
Eta T T Tau 
Theta T υ Upsilon 
Iota ® φ Phi 
Kappa xX i Chi 
Lambda Ψ Ψψ Psi 

Mu Q ω Omega 


INDEX 


Absolute maximum, 411 
Absolute probability, 502 
Absolute value, 11, 52, 53, 165, 269 
Abstract theories, 548 ff. 
Acceleration, 388, 462 
of gravity, 428 
Accuracy, 361, 366 
Addition, 10, 27 ff., 8327, 332, 348 
of ordinates, 295 
Adjacent members of a set, 353 
Admissible hypothesis, 533 
Affine transformation, 567 
Aleph null, 324 
Algebra 
elementary, 1 ff., 25, 27 ff., 41 ff., 65, 
94, 104 ff., 146 ff., 321 ff. 
of sets, 155 ff. 
All, 117, 121, 169 
Amplitude, 290 
Analytic geometry 
on a line, 2 ff., 22-23, 148 ff. 
in the plane, 174 ff., 245 
And, 71 
Angle, 214 ff., 238 ff. 
between two lines, 238, 240 
Angular velocity, 282 
Antiderivative, 429, 479, 559 ff. 
Approaches, 387, 390 
Approximation, 14, 360, 365, 368, 377, 
405, 4386, 471 
Are, 214 ff. 
Arc length, 217, 468, 477 
Area, 377, 434, 443 
Area of a circular sector, 218 
Arithmetic, 10 ff., 27 ff., 44-46, 298, 
327, 357, 360, 865 
Arithmetic mean, 150 
Arithmetic operations on functions, 294 
Arithmetic progression, 339, 346 
Associative law 
of addition, 28, 65 
of multiplication, 29, 65 
Associativity, 155, 281, 555 
Asymptote of a hyperbola, 311 
Average, 189, 539 
Average rate of change, 379 


Average velocity, 378 
Axial symmetry, 310 
Axiomatics, 577 ff. 
Axioms, 64, 321, 571 
for the natural numbers, 330 
of probability, 488 
for the real numbers, 64 ff. 
Axis, 2, 175, 306 
of symmetry, 277, 307 


Bar, 34 

Base, 357, 453 

Basic assumption, 64 

Basic term, 47 

Belong, 134 

Biconditional, 77 

Binary operation, 261, 549, 550, 572, 
573 

Binary relation, 248 

Binary system, 358 

Binomial coefficient, 500 

Binomial distribution, 509 

Binomial theorem, 419, 500 

Bisector of a segment, 188 

Boolean algebra, 572 ff. 

Border, 561 

Bounded set, 355 

Brace, 34, 135 

Brackets, 34 

Break-even chart, 203 

Buffon’s needle problem, 516 


Calculus, 377 ff. 
Calculation, 365 ff., 455, 462 
Cancellation, 31, 32, 569 
Cap, 152 
Cardinal number, 321 ff. 
Cartesian graph of a sentence, 246 
Cartesian plane, 245 
Cartesian product, 244, 328 
Categorical theory, 579 
Causality, 261 
Center 

of ellipse, 306 

of the hyperbola, 310 

of symmetry, 310 
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Central limit theorems, 535 

Central symmetry, 310 

Certain event, 489 

Chain rule, 417 ff. 

Chances, 495 

Characteristic of logarithm, 455 

Chebyshev’s inequality, 522 ᾿ 

Chess, 494 

Circle, 188, 209 ff., 266, 302 

Circuits, 573 

Circular function, 289, 396, 426, 482 

Circular motion, 282 

Circularity, 46, 64-65 

Class, 322 

Classical logic, 169 

Classification, 497, 500 

Closed interval, 145 

Closure, 261, 337, 348, 374, 376, 550, 
555 

Cocfhicient, 403 

Column, 202 

Combination, 497 

Combinatorial analysis, 492 

Common difference, 339 

Common logarithm, 455 

Common ratio, 341 

Commutative group, 555 

Commutative law 

of addition, 28, 65 
of multiplication, 29, 65 

Commutativity, 125, 155, 281 

Complement, 576 

Complement of a set, 151 

Complementary relation, 265 

Complete independence of axioms, 
578 

Complete theory, 579 

Completing the square, 211, 275, 314 

Complex conjugate, 376 

Complex number, 5, 369 ff., 378, 552, 
553 

Component, 181, 182, 241 

Composite, 280 ff., 298 

Compound event, 502 . 

Computation, 365 ff., 455, 462 

Concave downward, 413 

Conclusions, 77. ὁ 

Concurrent lines, 204 

Conditional, 76 

Conditional probability, 501 

Cone, 300 

Confidence coefficient, 542 


INDEX 


Confidence interval, 542 
Congruence 

(geometry), 98, 217 

mod m, 5538 
Conic section, 300, 566 
Conjugate, 376. 
Conjunction, 71 ff. 
Consistent equations, 206 
Consistent theory, 578 
Constant, 6 ff., 94 
Constant function, 268 
Constant of proportionality, 271, 426 
Constructive proof, 119 
Context, 6, 16, 25, 50 
Continued product of notation, 347 
Continuous distribution, 512 
Continuous function, 394, 421, 448 
Continuous set, 356 
Continuum, 356 
Contradiction, 83, 99 
Contraposition, 87 
Converse, 77 
Converse relation, 284 
Coordinates, 2, 175 ff. 
Correspondence, 63, 260 
Cosine, 223 ff., 258, 290 
Cosines, law of, 239 
Cost, 203, 276 
Countable set, 326 
Counterexample, 104 
Counting, 322, 325 
Critical point, 410 
Critical region, 533 
Cross classification, 500 
Cube, 48 
Cube root, 259 
Cube roots of 1, 559 
Cubic, 403 
Cumulative distribution function, 507 
Cup, 152 
Cut, 354 
Cyclic group, 571 
Cycloid, 467 


Decay, 459 

Decimals, 4, 59 ff., 257, 354 ff. 
Decision function, 544 
Defined term, 48, 50 

Defining formula, 256, 280 
Defining sentence, 136, 255 
Defining term, 48, 51 

Definite integral, 439 ff. 


INDEX 


Definition, 46 ff., 67, 249, 254, 333, 438, 
581 ff. 

Definition form, 249, 254 

Degenerate conic, 302, 317 

Degree, 297, 403 

Degrees, 218 

Deleted neighborhood, 392 

Demand, 199 

DeMoivre’s theorem, 374 

De Morgan’s laws, 155 

Denominator, 34 

Dense sets, 358, 355 

Density function, 514 

Denumerable set, 324-325 

Dependent equation, 206 

Dependent variable, 256 

Derivative, 383, 387, 398, 559 
of composite, 417 
of inverse, 424 

Description, 164 ff. 

Descriptive statistics, 529 

Determination, 254, 259 

Deviation, 414 

Dichotomy, 497 

Difference (see also Subtraction), 12 

Differentiable, 402 

Differential, 471 

Differential equation, 428 ff., 462 

Digit, 59 

Direct variation, 268 ff., 271 

Directed angle, 217, 238 

Directed arc, 215 

Direction angle, 220, 222, 258 

Direction on an axis, 2 

Directrix, 302, 307, 310, 313 

Discontinuity, 394 

Discrete distribution, 507 

Discrete set, 353 

Discriminant, 108, 277 

Disjoint event, 488 

Disjoint relation, 265 

Disjoint set, 161 

Disjunction, 72 ff. 

Distance, 11, 165, 186 ff. 
directed, 9, 12, 183 
between a point and line, 208 
between two points, 188 
undirected, 165 

Distribution function, 507 

Distribution problem, 212 

Distributive law of multiplication, 

41 ff., 65, 155 
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Divisibility, 249 

Division, 13, 29, 55 ff., 67, 328, 552 
of a segment, 189 
by zero, 55 

Divisor, 249 

Domain, 136, 250 

Dot product, 240 

Double-angle formulas, 236 

Dummy, 126, 136 

Duodecimal system, 359 


Eccentricity of a conic, 302, 307, 310, 
313 
Electromotive force, 461 
Klementary functions, 518 
Elimination, 103 
Ellipse, 300, 305 ff., 317, 560 
Empty set, 138 
Equality, 18 
Equally likely, 491 
Equations, 18, 104 ff., 206 
conditional, 19 
simultaneous, 199 
Equivalence, logical, 78, 101, 106, 1389 
Equivalence relation, 552, 572, 574 
Error, 361 
Errors of Type I and II, 533 
Estimation, 367, 538 
Estimator, 538 
Euclidean geometry, 301, 565 
Euclidean group, 565 
Even function, 408 
Even number, 45, 138, 351, 559 
Event, 486 
Evidence, 132 
Exact rate of change, 379, 381, 398 
Exact velocity, 378 
Excluded middle, law of, 99 
Exclusive disjunction, 72 ff. 
Exclusive union, 154 
Exhaustive classification, 497 
Existence theorem, 119 
Existential quantifier, 118 
Expanding, 43 
Experimental design, 535 
Explicit definition of a function, 255 
Exponent, 340, 352, 356, 571 
Exponential function, 341, 453, 459, 
559 
Expression, 6 
Extension of the number system, 348 ff. 
Extremum, 410 
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Factor, 12, 50, 249 
Factorial, 341 
Factoring, 43 
Factor theorem, 407 
Fallacies, 96, 142 


Familial relations, 251, 266 ff., 280 ff., 
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Fechner’s law, 461 
Fibonacci sequence, 342 
Field, 550 ff. 

of integers mod m, 553 
Finite cardinal, 324 
Finite field, 551 
Finite induction, 334 ff. 
Finite set, 324 
First moment, 509 
First term, 339 
Focus of a conic, 302, 307, 310, 313 
Force, 229 
Form, 17 
Formal definition, 48 
Formal proof, 93 
Formulas, 16 
Fractions, 3, 34, 55 ff., 67 
Freely falling body, 388, 428, 433 
Frequency, 290, 491 
Frequency function, 507 


Function (see also Linear function, 


Even function, etc.), 254 ff., 
563 
Functional notation, 113 ff., 254 ff. 


Fundamental theorem of integral cal- 


culus, 446 


Game of strategy, 258 
Genetics, 500, 505 | 


Geometric interpretation, 2 ff., 10 ff., 


173 ff., 563 
Geometric mean, 150 
Geometric progression, 347 
Geometries, 565 
Graph, 2, 144, 177, 246, 249, 256 
Greater than, 22 
Greatest lower bound, 355, 438 
Group, 554 ff. 
Growth, 459 


Half-life, 461 

Heuristic, 93, 128 ff., 337 
Hyperbola, 300, 308 ff., 317, 560 
Hypergeometric distribution, 511 
Hypocycloid, 467 


INDEX 


Hypothesis, 64, 77 
rule of, 94 
statistical, 532 


Idempotent laws, 155 
Identities, 19, 28 ff., 41 ff., 55 ff. 
logical, 85 ff. 
trigonometric, 231 ff. 
Identity, 18 ff., 85, 142, 281 
Identity element, 67, 550, 555 
Identity function, 268, 297 
Idioms, 261 
If, 74 
If and only if, 78, 302 
Image, 250, 254, 258, 260 
Imaginary ellipse, 317 
Imaginary number, 5, 373 
Imaginary part, 373 
Imaginary unit vector, 372 
Immediate predecessor, 353 
Immediate successor, 353 
Implication, 74 ff., 98 ff. 


Implicit definition of a function, 255, 


423 
Impossible event, 486 
Inclination, 238 
Inclusion, 148, 170 
Inclusive disjunction, 72 ff. 
Inconsistent equations, 206 
Inconsistent theory, 578 
Increment, 379, 401, 478 
Indefinite integral, 431 
Independent axioms, 578 
Independent equations, 206 
Independent events, 503 
Independent variable, 256 
Indirect argument, 102, 351 
Induction, 334, 336, 420 
Induction hypothesis, 336 


Inequalities, 22, 72, 144 ff., 197, 3338 


Inference, rule of, 92 
Inferential statistics, 529 
Infinite decimal, 59 ff. 
Infinite set, 324 

Infinity, 145, 880, 442 
Inflection, 415 

Informal definition, 48 
Informal proof, 93 
Initial point, 9 

Initial side of angle, 217 


Instantaneous rate of interest, 461 


Instantaneous velocity, 378 


INDEX 


Integers, 1 ff., 187, 350 
Integral, 478 
Integral domain, 553 
Integrand, 446 
Intercept, 195, 258 
Intercept form, 197 
Intercepted arc, 217 
Interest, 342, 389, 421, 461 
Interpolation, 227 ff., 456 ff. 
Intersection, 199 

of relations, 265 

of sets, 151 
Interval, 144, 156 
Interval estimation, 542 
Invalid argument, 96 
Invariance, 565 
Invariants of a conic, 320 
Inverse, 296, 348, 550, 555 
Inverse circular function, 291 
Inverse element of group, 569 
Inverse function, 285 
Inverse variation, 426 
Inversely proportional, 426 
Involution laws, 155 
Irrationality of the square root of 2, 351 
Irrational number, 4, 61 ff., 351 ff. 
Irrational point, 356 
Isomorphic group, 558 
Isomorphism, 349, 370, 571 


Just, 125 


Lagrange’s theorem, 570 
Large numbers, laws of, 522 ff. 
Law, 24 ff., 77, 119 

of cosines, 239 

of large numbers, 521 

of mean, 480 

of sines, 239 
Least upper bound, 355, 438 
Lemma, 99 
Length, 378 

of an are, 217, 468, 477 

of a vector, 11, 52, 186 
Less than, see Inequalities 
Level of significance, 533 
Likelihood, 543 
Limit, 390 ff., 442, 514 

of integration, 446 

of summation, 344 
Limiting case, 303 
Linear function, 268 ff., 282 
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Linear equation, 107, 193 

Linear graph, 144 

Linear (homogeneous) transformation, 
567 

Linear interpolation, 227, 456 

Linear transformation, 567 

Linear velocity, 282 

Local maximum, 411 

Locus, 188 

Logarithm, 453 ff. 

Logic, 70 ff., 252, 337, 559, 573 

Logical relation, 267 

Logically equivalent, 78, 106, 139 

Logistic law, 462 

Lower bound, 355 

Lower limit of integration, 446 

Lower sum, 440 


Malthus, 342 
Major axis of ellipse, 306 
Mantissa, 455 
Many-many correspondence, 260 
Many-one correspondence, 260 
Mapping, 260, 285 
Marginal cost, 399, 414: 
Marginal revenue, 414 
Marginal unit cost, 427 
Marginal utility, 414 
Material implication, 76 
Mathematical statistics, 530 
Matrix, 202 
Maxima and minima, 164, 276, 409 ff. 
Maximization principle, 546 
Maximum likelihood estimate, 543 
Maximum of a function, 278 
Mean, 150, 414, 509 
law of, 430 
Mean deviation, 510 
Measure, 434, 488 
of central tendency, 414 
Median, 539 
Member, 134 

of equation, 18 

of inequality, 22 
Metatheory, 96 
Midpoint of a segment, 188 
Mil, 218 
Minimum, 164, 276 

of a function, 278 
Minor axis of ellipse, 306 
Minus sign (see also Negative), 30, 

56 ff. 
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Mode, 539 
Moment, 509 
Most probable value, 511 
Multinomial coefficient, 500 
Multinomial theorem, 500 
Multiple, 249 
Multiplication, 12, 29, 44, 45, 247, 
332 

of cardinals, 328 

of rationals, 348 

of vectors, 369 ff. 
Multi-valued function, 261 
Mutually exclusive classification, 497 
Mutually independent, 504 


Names, 6 

Natural logarithm, 459 

Natural number, 324 

Necessary, 75, 108 

Necessary and sufficient, 78 

Negation, 2, 11, 257 

logieal, 71, 122 

Negative, 2, 65, 182 

Neighborhood, 392 

Nested intervals, 354 

Newton’s method, 405 

Nonconstructive proof, 120 

None, 121, 169 

Nonfunctional relation, 261 

Norma! distribution, 517, 540 

Norm function, 269 

Noun, 134 

nth roots of 1, 560, 571 

nth term, 339 

Null relation, 264 

Null set, 139 

Null vector, 182 

Number (see also Complex number, 
Even number, Integer, Natural 
number, Odd number, Prime num- 
ber, Positive integer, Rational 
number, Real number), 1 ff., 321 

Numeral, 6 

Numeration, 357 

Numerator, 34 


Odd function, 408 

Odd number, 45, 138, 352, 559 
Odd power function, 404 
Odds, 495 

One, 30, 65, 324, 330 

One and only one, 125 


INDEX 


One-to-one correspondence, 63, 1738, 
181, 216, 262, 322, 349, 563 

Only, 75 

Only one, 125 

Open interval, 145 

Operation, 67, 260, 555 

Operational definition, 51 

Operations of arithmetic, 10, 33 

Or, 72 ff. 

Orbits of planets, 308 

Order of magnitude, 363, 364 

Order of operations, 34 ff. 

Ordered field, 553 

Ordered pair, 173, 244 ff., 372 

Ordered partition, 497, 498 

Ordering of real numbers, 146 ff. 

Ordinate, 295 

Origin, 2 

Output, 388, 398 

Overhead, 426 

Overlaps, 162 


Pair (see also Ordered pair), 140 

Pairing, 262 

Parabola, 275 ff., 300, 313 ff., 316 

Paradoxes of sets, 137 

Parallel, 19, 26, 49, 102, 205, 302 

Parameter, 22, 192 

Parametric equations of a line, 192 

Parametric representation, 192, 463 

Parentheses, 32 ff., 39 

Pareto distribution, 516 

Partition, 496 

Payoff, 508 

Peano postulates, 330 

Percent, 53 

Period, 236, 289, 290 

Periodic, 236, 289, 336 

Permutation, 325, 493, 564 

Permutation group, 564 

Perpendicular lines, 205 

Perpendicular vectors, 241 

Phase, 290 

Piecewise linear function, 269 | 

Plane geometry (see also specific 
topics), 208, 213 

Plane vector, 369 ff. 

Plausibility, 29, 41, 93, 394, 418 

Point, 1 ff., 63, 65, 175 ff., 486 

Point estimate, 539 

Poisson distribution, 511 

Polar axis, 220 


INDEX 


Polar coordinates, 220, 304, 470 

Polar distance, 220 

Polar form of. complex numbers, 
373 

Polar form of conic, 304 

Polygon, 338 

Polynomial, 297, 396, 403 

Population, 388, 461, 462, 493 

Positional system, 357 

Positive integers, 137, 147, 350 

Possible error, 363 

Postulate, 64 

Power function, 404, 534, 559 

Power of a test, 534 

Power set, 144, 259 

Powers, 34 

Precision, 361, 365 

Pressure, 273, 477 

Prime number, 103, 138 

Principal axis of hyperbola, 309 

Principal nth root, 352, 375 

Principal square root, 52 

Probability, 485 ff. 

Probability density, 514 

Product (see also Multiplication), 12, 
65, 247 

cartesian, 244 
of two cardinals, 328 

Profit, 276 

Projectile, 279, 416, 421, 465 

Projection of vectors, 241 

Proof, 29, 31, 45, 64, 88 ff., 102, 119, 
120, 208, 213 

Proper subset, 162 

Properties, 244 

Proportional, 271, 426 

Proposition, 21 

Propositional formula, 21 

Pure imaginary, 373 

Pythagorean theorem, 4, 45, 96 


Q.E.D., 94 | 
Quadrant, 175 7 
Quadratic equation, 107, 150, 277 
Quadratic formula, 108 
Quadratic function, 274, 282, 313 
Quantification and quantifiers, 26, 
117 ff., 126, 141, 394 
Quartic, 403 
Quintic, 403 
Quotation marks, 6 ff., 38, 117 
Quotient (see also Division), 67 


Radian, 218, 222 
Radical, 34, 352 
Radioactive materials, 461 
Random sample, 493 
Random variable, 507 
Range, 136, 250 
Rate of change, 379, 381, 398 
Ratio, 1 
Rational function, 297 
Rational numbers, 3, 60 ff., 187, 325, 
| 348, 355 
Rational root, 407 
Ray, 216 
Real number, 4, 63, 65 ff., 144, 146 ff., 
326, 354 ff., 356, 373 
Real part, 373 
Rearrangement, 27, 31 
Reciprocal, 65 
Reciprocal function, 296 
Rectangular coordinates, 220 
Rectangular form, 373 
Recurrence relation, 340, 408 
Recursive definition, 340, 571 
Reduced equation, 407 
Reflection, 284 
Reflexivity, 552 
Region, 211 
Relation, 115, 244 ff. 
Relations between sets, 160 ff., 267 
Relative complement, 154 
Relative error, 361, 477 
Relative frequency, 491 
Relative maximum and minimum, 
410 ff. 

Reliable, 361 
Remainder theorem, 406 
Repeating decimal, 60 ff. 
Replacement, 36 ff., 91 
Resolved vectors, 241 
Resultant, 229 
Revenue curve, 203 
Revolutions, 218 
Right-hand derivative, 465 
Right-hand limit, 465 
Rigid motions, 565 
Roots, 352, 375 

of equations, 19 

of unity, 375, 571 
Roster notation, 135, 255 
Rotation, 315, 318, 566 
Rounding off, 362, 367 
Row, 202 
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Rule 

of choice, 120 

heuristic, 130 

of induction, 336 

of inference, 92 

of replacement, 37 ff. 

of substitution, 39, 92, 478 
Rules of proof, 90 ff. 


Sample, 493 

Sample space, 486 

Sampling, 493 

Satisfy, 22, 176 

Scientific notation, 362 

Scope, 34 ff., 125 

Secant line, 378 

Second derivative, 412 

Sector, 217 

Segments, 197 

Semi-major and semi-minor axcs, 307 
Sentence, 21 ff., 24, 136, 176 
Sequence, 339 

Set, 134 ff., 263, 486 ff. 

Sets and sentences, 168 ff. 

Side of an angle, 217 

Side of an equation, 18 
Significant digit, 361, 363 
Significant substitute, 7 

Signum function, 270 

Similar sets, 322 

Simple interest, 272 

Simplifying, 31 

Simultaneous linear equations, 199 
Simultaneous sentences, 266 
Sine, 223 ff., 230, 258, 289 ff., 368 
Sine wave, 290 

Sines, law of, 289 

Singleton, 140 

Single-value function, 261 

Slope, 194, 205, 238, 398 
Slope-intereept form, 195 
Smooth, 403 

| Solution, 22, 176 

Solving equations, 19, 105 ff., 405 ff. 
Solving inequalities, 23 ff., 148, 166 
Solving sentences, 22 

Solving a triangle, 233 

Some, 118, 121, 169 

Special legislation, 141 
Specialization, 38 

Speed, 382 

Square, 47, 148 


Square root, 52, 58, 150, 259, 267 

Standard deviation, 510 

Standard form of an ellipse, 306 

Standard normal distribution, 517 

Standard position, 216, 217 

Statement, 21 

Statistical decision functions, 544 

Statistical hypothesis, 532 

Statistical independence, 503 

Statistical inference, 529 

Statistics, 493, 5380 

Step function, 270 

Stirling’s formula, 499 

Straight line, 190, 302 

Stroke, 89 

Strong law of large numbers, 524 

Subgroup, 569 

Subrelation, 265 

Subscripts, 7 

Subsets, 138, 143 ff., 486 

Substitution, 7, 37 ff., 40, 118, 115 
rule of, 92 

Subtraction, 12, 29, 57, 67, 327, 552 

Successive term, 339 

Successor, 330 

Successor function, 330 

Such that, 164 

Sufficient, 75, 108 

Sum (see also Addition), 10, 65 

Sum formulas, 234 

Sum of cardinals, 327 

summand, 344 

summations, 343, 448 

Supply, 199 

Switching circuits, 573 

Syllogism, law of, 100 

Syllogistic reasoning, 169 

Symbol, 6 

Symmetric formula, 564 

Symmetric group of degree n, 564 

Symmetries, 556 

Symmetry, 277, 312, 552, 560 
with respect to a line, 306 
with respect to a point, 310 

Synonymous, 18, 20, 36, 82, 105 

Systems of numeration, 357 


Tables, 224 ff., 259 

of antiderivatives, 432 
Tangent, 258 
Tangent function, 292, 368 
Tangent line, 379, 381, 398, 426 
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Tautology, 83, 98 

Temperature, 273 

Terminal point, 9 

Terminal side of angle, 217 

Terminating decimal, 59 

Term, 16, 47 

Tesselation, 561 

Testing, 530 

Then, 74 

Theorem, 64 

Theory of equations, 406 

Topology, 566 

Transfinite cardinal, 324, 329 

Transform, 261 

Transformation, 261, 563 

Transformation group, 562 ff. 

Transitivity, 552 

Translation, 314, 317, 566 

Transverse axis of hyperbola, 310 

“Trees,” 500 

Triangle, 48, 49 

Triangle inequality, 186 

Trigonometry (see also Circular func- 
tions), 233 

Triple, 140 

Truth set, 136, 246 

Truth table, 81 ff., 257 

Truth value, 71, 116, 167, 257 

Two-point form, 192 


Unary operation, 261, 572 
Unbiased estimate, 539 
Unbounded set, 355 
Uniform distribution, 511, 514 
Union of sets, 152, 265, 327 
Uniqueness, 65, 569 

Unit circle, 214 

Unit cost function, 427 
Unit distance, 2 

Unit point, 2 

Unit step function, 270 
Universal quantifier, 118 
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Universal relation, 264 
Universal set, 138 

Universe of discourse, 138 
Unknowns, 22 

Upper bound, 355 

Upper limit of integration, 446 
Upper sum, 440 


Vacuous set, 138 
Valid argument, 95 
Value, 6 ff., 16 
Value of a function, 261 
Variable, 7 ff., 16 
dependent, 256 
independent, 256 
quantified, 126 
Variance, 510, 539 
Varies directly, 271 
Vector, 9 ff., 65, 165, 180, 215, 559, 567 
addition, 10, 181 
division by a real number, 184 
length of, 186 
multiplication by a real number, 183 
Velocity, 282, 378, 425 
Venn diagrams, 152 
Verification, 132, 336 
Vertex 
of an angle, 217 
of cone, 300 
of a parabola, 276, 313 
Vertices of ellipse, 306 
Vertices of a hyperbola, 309 
Volume of gas, 273 


Wave motion, 290, 293 

Weak law of large numbers, 523 
Well-ordering, 338 

Work, 452, 477 


Zeno, 379 
Zero, 1, 30, 55, 65, 324 
of a function, 278 
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